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DESCRIPTION 
PROTEIN KINASES 

FIELD OF THE INVENTION 
The present invention relates to novel kinase polypeptides, nucleotide sequences 
encoding the novel kinase polypeptides, as well as various products and methods useful 
- for the diagnosis and treatment of various kinase-related diseases and conditions. 

BACKGROUND OF THE INVENTION 
The following description of the background of the invention is provided to aid in 
understanding the invention, but is not admitted to be or to describe prior art to the 
invention. 

Cellular signal transduction is a fundamental mechanism whereby external stimuli 
that regulate diverse cellular processes are relayed to the interior of cells. One of the key 
biochemical mechanisms of signal transduction involves the reversible phosphorylation of 
proteins, which enables regulation of the activity of mature proteins by altering their 
structure and function. 

Protein phosphorylation plays a pivotal role in biological signal transduction. 
Among the biological functions controlled by protein phosphorylation are the following- 
cell division; differentiation and death (apoptosis); cell motility and cytoskeletal structure; 
control of DNA replication, transcription, splicing and translation; protein translocation 
events from the endoplasmic reticulum and Golgi apparatus to the membrane and 
extracellular space; protein nuclear import and export; regulation of metabolic reactions, 
etc. Abnormal protein phosphorylation is widely recognized to be causally linked to the 
etiology of many diseases including cancer as well as immunologic, neuronal and 
metabolic disorders. 

The most common phospho-acceptor amino acid residues are serine, threonine and 
tyrosine. Phosphorylation in histidine has also been observed in bacteria. The presence of 
a phosphate moeity modulates protein function in multiple ways. A common mechanism 
includes changes in the catalytic properties (V™* and K m ) of an enzyme leading to its 
activation or inactivation. A second widely recognized mechanism involves promoting 
protein-protein interactions. An example of this is the tyrosine autophosphorylation of the 
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Ugand-acva.cd EOF receptor .yros.nc kinase. This even, .riggers Ihe lugh-alf.niiy 
binding .o ,he phosphotyros.nc residue on the receptor's C-.crm.na! .n.racellular domain 
,„ ,1,0 SH2 mo.ifof.hc adap.or molecule Grb2 Grb2 .n mm binds .hrougl, ,.s SID motif 
,„ a second adapio, molecule, such as SHC. The formation of .his ternary complex 
ac.va.es the signaling events .ha. are response for .he bio.og.ca, effecs of EOF Sonne 
and .hreoninc phosphorylation events have also being recently recognized to exen .he.r 
biological function through pro.cin-pro.cin in.crac.ion evc„,s mediated by the high- 
aflini.y b.nd.ng of phosphosennc and phospho.hreon.ne .o WW motifs prcsen, ,n a large 
varicy of proteins (Lu, P.J. «, ai ,1999, Science 283:1325-1328). A .h.rd important 
outcome of protein phosphorylation is changes in the subce.huar localization of the 
substrate. As an example, nuclear impon and export events in a large diversity of proteins 
are regulate, by P ro.e.n phosphory.a.ion (Dner E.A. e, al < . 999, Genes Dev ,3: 556- 

568). . . , 

Protein kinases are one of .he Urges, families of eukaryo.ic pro.e.ns w..h several 

hundred known members. These proteins share a 250-300 amino ac.d domain ,ha, can be 
subdivided into ,2 distinct subdomains that compose the common catalytic core structure 
These conserved protem motifs have recently been cxplo.ted using PCR-based and 
bioinformatic strategies leading to a signif.can. expansion of .he known kinases. Multiple 
alignment of the sequences ,n the catalytic domain of pro.ein kinases and subsequen, 
parsimony ana!ysis permi.s .he,r segregation m,o a dendrogram Meeting .he rela.edness 
of , her catalytic domains (Fig. 1). In tins manner, rela.ed kinases are elus.ered in.o 
distinct branches or subfamilies including: tyrosine kinases, cychc-nucleoude-dependen, 
kinases, calcum/calmoduhn kinases, eyenn-dependen, kinases and MAP-k.nascs, senne- 
.hreonine kinase receptors, and several other less defined subfamll.es. 

We have recently completed a systematic analys.s of the protem kinases present in 
C «/«««. the multicellular organism whose entire DN A sequence has been determined 
We identified 473 unique kinase profiles including 398 full-length conventional kinases, 
and 20 additional pro.e.ns .ha. may function as a.yp.cal pro.cn kinases (Plowman G.D. 
aal (1999). Froc. Natl. Acad. Sci. 96:13603-13610) 

' Using parsimony analysis, the protein kinases may be divided .-to 4 major groups 
AGC CAMK, CMGC and tyrosine k.nases. In addition, there are a number of minor ye, 
distinc, famil.es. .ncluding the STE and casein kinase 1 , fam.hes rela.ed ,o worm- or 
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fungal-specific kmases. and a family designated "other- to represent several smaller 
families. In addition, we destgnate an "atypical" family to represent protein kinases whose 
catalytic domain has little or no primary sequence homology to conventional kinases, 
including the A6 kinases and PI3 kinases. 

The AGC kinases are basic amino acid-directed enzymes that phosphorylate 
residues found proximal to Arg and Lys. Examples of this group are the cyclic nucleotide- 
dependent kinases, G protein kinases, NDR or DBF2 and the ribosomal S6 kinases. 

The CAMK group kinases are also basic amino acid-directed kinases. They include 
the Ca2+/calmoduIin-regulated and AMP-dependent protein kinases, myosin light chain 
kinases, checkpoint 2 kmases (CHK2) and EMK-related protein kinases. The EMK family 
of STK are involved in the control of cell polarity, micotubule stability and cancer. One 
member of the EMK family, C-TAK1 has been reported to control entry into mitosis by 
activating Cdc25C which in turn dephosphorylates Cdc2. 

CMGC group kinases are "prolme-directed" enzymes phosphorylating residues 
that exist in a proline-rich context. They include the cyclin-dependent kinases (CDKs), 
mitogen-activated kinases (MAPKs), GSK3s and CLKs. Most CMGC kinases have 

larger-than-average kinase domains owing to the presence of insertions within subdoma.ns 
X and XI. 

The tyrosine kinase group encompass both cytoplasmic (i.e. sre) as well as 
transmembrane receptor tyrosine kinases (i.e. EGF receptor). These kinases play a pivotal 
role in the signal transduction processes that mediate cell proliferation,- differentiation and 
apoptotis. 

Group members that define smaller, yet distinct phylogenetic branches of 
conventional kinases include the elongation factor 2 kinases (EIFKs); homologues of the 
yeast sterile family kinases (STE) which refers to 3 classes of kinases which lie 
sequentially upstream of the MAPKs; mixed lineage kinases (MLKs); Lim-domain 
containing kinases (LIMKs); Calcium-calmodulin kinase kinases (CAMKK), dual-specific 
tyrosine kinases (DYRK), integnn receptor associated kinase (IRAK); testis-specific 
kinases (TSK); UNC-51 related kinases (UNC); several families that are close 
homologues to worm (C26C2. 1 , YQ09, ZC581.9, YFL033c, C24A1.3), Drosophila 
(SLOB), or yeast (YDOD_sp, YGR262^sc) kmases, and others that are "unique" and 
don't cluster into any obvious family. 
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SUMMARY OFTHEJNVEmiQN 
Through a search of^hTr^dTtahase for homolog.es to the conserved catalytic 
kinase domain of proton kinases, hundreds of mammahan members of known and 
previously unidentified protein kinase famil.es and group, have been identified as pari o. 
the present invention. Multiple alignment and parsimony analysis of the catalyttc domain 
rcV ea.s that approximately half of these protein kinases cluster into 10 known groups, w,th 
the other half perhaps defining novel groups. Classification in this manner has proven 
hlg hly accurate not only in predicting motifs present in the remaining non-catalytic portion 
of each protein, but also in their regulation, substrates, and signaling pathways. The 
present invention includes the partial or complete sequence of new protein kinases, then 
classification, predicted or deduced protein structure, and a strategy for elucidating their 

bioloeic and therapeutic relevance. 

" Thus a first aspect of the invention features an isolated, enriched, or punfied 
nucleic acd molecule encodtng a kinase polypepttde selected from the group conststtng 
SEQ ID NO:122, SEQ DO NO:123, SEQ ID NO,24. SEQ ID NO,25, SEQ ID NO:126. 
SEQ ID NO:127, SEQ ID NO: 128, SEQ1D NO:129, SEQ ID NO: 130, SEQ ID NO: , 
SEQ ID NO:,32, SEQ ID NO:,33, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO,36, 
SEQ ID NO, 37, SEQ ID NO: 1 38, SEQ ID NO: 1 39, SEQ ID NO: HO, SEQ ID 
SEQ ID NO-,42, SEQ ID NO,43, SEQ ID NO,44, SEQ ID NO:.45, SEQ ID NO,46, 
SEQ ID NO .47, SEQ ID NO, 48, SEQ ID NO, 49, SEQ ID NO, 50, SEQ IDNO,51, 
SEQ ID NO' 1 52 SEQ ID NO, 53, SEQ ID NO: 154, SEQ ID NO,55, SEQ ID NO, 56, 
SEQ ID NO, 57 SEQ ID NO, 58, SEQ ID NO, 59, SEQ ID NO: 1 40. SEQ ID NO: 1 6 , , 
SEQ ID NO,62, SEQ ID NO,63, SEQ ID NO,64, SEQ ID NO,65. SEQ ID NO:,66, 

SEQ ID NO, 67, SEQ ID NO, 68, SEQ ID NO: 169, SEQ ID NO, 70, SEQ 

SEQ ID NO, 72. SEQ ID NO, 73, SEQ ID NO 1 74. SEQ ID NO, 75. SEQ ID NO, 7 , 

SEQ ID NO, 77, SEQ ID NO: 1 78, SEQ U> NO, 79. SEQ ID NO: 1 80, SEQ ID NO: , 
SEQ ID NO, 82, SEQ ID NO: 183. SEQ ID NO, 84, SEQ ID NO, 85. SEQ ID NO, 86, 
SEQ ID NO, 87, SEQ ID NO: 188, SEQ ID NO, 89, SEQ ID NO, 90. SEQ ID NO, . 
SEQ m NO, 99. SEQ ID NO, 93. SEQ ID NO, 94. SEQ ID NO: 1 95. SEQ ID NO: 
SEQ ID NO .97. SEQ ID NO: 1 98. SEQ ID NO:,99, SEQ ID NO 200. SEQ ID 
SEQ ID NO:202. SEQ ID NO:203. SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206. 
SEQ ID NO:207. SEQ ID NO:208. SEQ ID NO:209. SEQ ID NO:2 ,0. SEQ ID N0 2I 
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SEQ ID NO:2I2, SEQ ID NO:213. SEQ ID NO:214, SEQ ID NO:215. SEQ ID NO:216, 
SEQ ID NO:217, SEQ ID NO:21S. SEQ ID NO:219, SEQ ID NO:220. SEQ ID NO:221, 
SEQ ID NO 222, SEQ ID NO. 223, SEQ ID NO:224, SEQ ID NO 225. SEQ ID NO:226, 
SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230. SEQ ID NO:23 1 , 
SEQ ID NO:232, SEQ ID NO:233, SEQ ED NO:234, SEQ ID NO:235, SEQ ID NO:236, 
SEQ ED NO:237, SEQ ID NO:23S, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:24I, 
andSEQIDNO:242. 

By "isolated" in reference to nucleic acid is meant a polymer of nucleotides 
conjugated to each other, including DNA and RNA, that is isolated from a natural source 
or that is synthesized. The isolated nucleic acid of the present invention is unique in the 
sense that it is not found in a pure or separated state in nature. Use of the term "isolated" 
indicates that a naturally occurring sequence has been removed from its normal cellular 
(i.e., chromosomal) environment. Thus, the sequence may be in a cell-free solution or 
placed in a different cellular environment. The term does not imply that the sequence is 
the only nucleotide chain present, but that it is essentially free (about 90 - 95% pure at 
least) of non-nucleotide material naturally associated with it, and thus is distinguished 
from isolated chromosomes. 

By the use of the term "enriched" in reference to nucleic acid is meant that the 
specific DNA or RNA sequence constitutes a significantly higher fraction (2-5 fold) of 
the total DNA or RNA present in the cells or solution of interest than in normal or 
diseased cells or in the cells from which the sequence was taken. This* could be caused by 
a person by preferential reduction in the amount of other DNA or RNA present, or by a 
preferential increase in the amount of the specific DNA or RNA sequence, or by a 
combination of the two. However, it should be noted that enriched does not imply that 
there are no other DNA or RNA sequences present, just that the relative amount of the 
sequence of interest has been significantly increased. The term "significant" is used to 
indicate that the level of increase is useful to the person making such an increase, and 
generally means an increase relative to other nucleic acids of about at least 2 fold, more 
preferably at least 5 to 10 fold or even more. The term also does not imply that there is no 
DNA or RNA from other sources. The other source DNA may, for example, comprise 
DNA from a yeast or bacterial genome, or a cloning vector such as pUC 1 9. This term 
distinguishes from naturally occurring events, such as viral infection, or tumor type 
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growths, in which the level of one mRNA may be naturally increased relative to other 
species of mRNA. That is, the term is meant to cover only those situations in which a 
person has intervened to elevate the proportion of the desired nucleic acid. 

It is also advantageous for some purposes that a nucleotide sequence be in purified 
5 form. The term "purified" in reference to nucleic acid does not require absolute purity 

(such as a homogeneous preparation). Instead, it represents an indication that the 
sequence is relatively more pure than in the natural environment (compared to the natural 
level this level should be at least 2-5 fold greater, e.g., in terms of mg/mL). Individual 
clones isolated from a cDNA library may be purified to electrophoretic homogeneity. The 

10 claimed DNA molecules obtained from these clones could be obtained directly from total 

DNA or from total RNA. The cDNA clones are not naturally occurring, but rather are 
preferably obtained via manipulation of a partially purified naturally occurring substance 
(messenger RNA). The construction of a cDNA library from mRNA involves the creation 
of a synthetic substance (cDNA) and pure individual cDNA clones can be isolated from 

1 5 the synthetic library by clonal selection of the cells carrying the cDNA library. Thus, the 

process which includes the construction of a cDNA library from mRNA and isolation of 
distinct cDNA clones yields an approximately 10 6 -fold purification of the native message. 
Thus, purification of at least one order of magnitude, preferably two or three orders, and 
more preferably four or five orders of magnitude is expressly contemplated. 

20 By a "kinase polypeptide" is meant 10 (preferably 20, more preferably 40, most 

preferably 75) or more contiguous amino acids set forth in an amino acid sequence 
selected from the group consisting of those set forth in SEQ ID NO: 122, SEQ ID NO: 123, 
SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:12S, 
SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, 

25 SEQ ED NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:13S, 

SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143, 
SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO:14S, 
SEQ ID NO: 1 49, SEQ ED NO: 1 50, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID NO: 1 53, 
SEQ ED NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, 

30 SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 6 1 , SEQ ID NO: 1 62, SEQ ID NO: 1 63 , 

SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID NO:16S, 
SEQIDNO:169, SEQ ID NO: 170, SEQ ID NO: 171, SEQ ID NO:172,SEQ ID NO:173, 
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SEQ ID NO: 174. SEQ ID NO: 175. SEQ ID \( ) 1 "o. SEQ ID NO 1". SEQ ID NO: 178. 
SEQ ID NO: 179, SEQ ID NO: I SO, SEQ ID NO:lSl. SEQ ID NO: 182. SEQ ID NO: 183, 
SEQ ID NO. 184. SEQ ID NO: 1 85. SEQ ID NO 186. SEQ ID NO: 187, SEQ ID NO: 188. 
SEQ ID NO: 189, SEQ ID NO: 190. SEQ ID NO 191. SEQ ID NO: 199, SEQ ID NO: 193, 
5 SEQ ID NO: 1 94, SEQ ID NO: 1 95, SEQ ID NO. 1 96, SEQ ID NO: 1 97, SEQ ID NO: 1 98, 

SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO.201, SEQ ID NO:202, SEQ ED NO:203, 
SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, 
SEQ ID NO:209, SEQ ED NO:210, SEQ ID NO:21 1, SEQ ED NO:212, SEQ ID NO:213, 
SEQ ID NO:214, SEQ ID NO:215, SEQ ED NO:216, SEQ ID NO:217, SEQ ID NO:218, 
10 SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ED NO:222, SEQ ID NO:223, 

SEQ ED NO:224, SEQ ID NO:225, SEQ ED NO:226, SEQ ID NO:227, SEQ ID NO:228, 
SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231 , SEQ ED NO:232, SEQ ID NO:233, 
SEQ ED NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, 
SEQ ED NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242, or functional 
15 derivatives thereof as described herein. For sequences for which the full-length sequence 

is not given, the remaining sequences can be determined using methods well-known to 
those in the art and are intended to be included in the invention. In certain aspects, 
polypeptides of 100, 200, 300 or more amino acids are preferred. The kinase polypeptide 
can be encoded by a full-length nucleic acid sequence or any portion of the full-length 
20 nucleic acid sequence, so long as a functional activity of the polypeptide is retained. By 

"functionar domain is meant any region of the polypeptide that may play a regulatory or 
catalytic role as predicted from amino acid sequence homology to other proteins or by the 
presence of amino acid sequences that may give rise to specific structural conformations 
(i.e., coiled-coils). For some purposes, polypeptide domains are preferred, including, but 
25 not limited to, N-terminaL catalytic/kinase and C-terminal. 

The amino acid sequence will be substantially similar to a sequence selected from 
the group consisting of those set forth in SEQ ED NO: 122, SEQ ID NO: 123, SEQ ID 
NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID 
NO:129, SEQ ID NO: 130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID 
30 NO:134, SEQ ID NO:135. SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ID 

NO:139, SEQ ID NO: 140. SEQ ID NO:141, SEQ ID NO: 142, SEQ ID NO:143, SEQ ID 
NO:144, SEQ ID NO:145. SEQ ED NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID 
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NO: 149, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 . SEQ ID NO: 152. SEQ ID NO: 1 53, SEQ ID 
NO: 154, SEQ ID NO: 155, SEQ ID NO: 1 56, SEQ ID NO: 1 57, SEQ ID NO: 1 58, SEQ ID 
NO:159. SEQ ID NO: 160, SEQ ID NO.161, SEQ ID NO 162. SEQ ID NO:163. SEQ ID 
NO:164, SEQ ID NO:165 SEQ ID NO: 166, SEQ ID NO 167, SEQ ID NO:16S, SEQ ID 
NO:169, SEQ ID NO:170, SEQ ED NO:171, SEQ ID NO: 172, SEQ ID NO:173, SEQ ID 
NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO 1 77, SEQ ID NO: 1 78, SEQ ID 
NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 181 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID 
NO: 1 84, SEQ ED NO: 185, SEQ ED NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID 
NO:189, SEQ ED NO:190, SEQ ED NO: 191, SEQ ED NO:199, SEQ ID NO:193, SEQ ID 
NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ED NO: 197, SEQ ED NO: 198, SEQ ID 
NO:199, SEQ ED NO:200, SEQ ED NO:201, SEQ ED NO:202, SEQ ID NO:203, SEQ ID 
NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ED NO:207, SEQ ID NO:208, SEQ ID 
NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID 
NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ED NO:217, SEQ ID NO:218, SEQ ID 
NO:219, SEQ ED NO:220, SEQ ED NO:221, SEQ ED NO:222, SEQ ID NO:223, SEQ ID 
NO:224, SEQ ED NO:225, SEQ ED NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID 
NO:229, SEQ ID NO:230, SEQ ED NO:231 , SEQ ID NO:232, SEQ ID NO:233, SEQ ID 
N0.234, SEQ ID NO:235, SEQ ED NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID 
NO:239, SEQ ED NO:240, SEQ ID NO:241 , and SEQ ID NO:242, or the corresponding 
full-length amino acid sequence, or fragments thereof. A sequence that is substantially 
similar to a sequence selected from the group consisting of those set forth in SEQ ID 
NO:122, SEQ ID NO:123, SEQ ED NO:124, SEQ ED NO: 125, SEQ ED NO:126, SEQ ID 
NO:127, SEQ ED NO:128, SEQ ED NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID 
NO: 1 32, SEQ ED NO: 1 33, SEQ ED NO: 1 34, SEQ ED NO: 1 35, SEQ ID NO: 1 36, SEQ ID 
NO:137, SEQ ID NO:138, SEQ ED NO:139, SEQ ID NO:140, SEQ ID NOM41, SEQ ID 
NO: 142, SEQ ID NO: 143. SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146. SEQ ID 
NO: 147, SEQ ID NO: 1 48, SEQ ED NO: 149, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID 
NO:152. SEQ ID NO: 153. SEQ ED NO: 154, SEQ ED NO: 155, SEQ ED NO: 156, SEQ ID 
NO:l 57, SEQ ID NO: 1 58, SEQ ID NO: 1 59. SEQ ID NO: 1 60, SEQ ID NO: 1 6 1 . SEQ ID 
NO: 162, SEQ ID NO: 163, SEQ ID NO: 164. SEQ ID NO: 165. SEQ ID NO: 166. SEQ ID 
NO: 167, SEQ ED NO: 168, SEQ ED NO: 169, SEQ ID NO: 170. SEQ ID NO: 171, SEQ ID 
NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74. SEQ ID NO: 175. SEQ ID NO: 1 76, SEQ ID 
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NO: 1 77. SEQ ID NO: 1 78. SEQ II) NO: 179. SEQ ID NO: ISO. SEQ ID NO:lSl, SEQ ID 
NO: 1 82, SEQ ID NO:lS3, SEQ ID NO: 184. SEQ ID NO 185. SEQ ID NO: 186. SEQ ID 
NO: 187, SEQ ID NO:18S, SEQ ID NO: 189, SEQ ID NO 190, SEQ ID NO: 191, SEQ ID 
NO:199, SEQ lDNO:193, SEQ ID NO:194. SEQ ID NO. 195, SEQ ID NO:196, SEQ ID 
NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID NO.200, SEQ ID NO:201, SEQ ID 
NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 
NO:207, SEQ ID NO:20S, SEQ ID NO:209, SEQ ID NO:210, SEQ ED NO:21 1, SEQ ID 
NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 
NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1 , SEQ ID 
NO:232, SEQ ID NO:233, SEQ ID NO:234. SEQ ID NO:235, SEQ ID NO:236, SEQ ID 
NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ 
ID NO:242 will have at least 75% identity (preferably 90%, more preferably at least 95% 
and most preferably 99-100%) to a sequence selected from the group consisting of those 
set forth in SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ 
ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ 
ID NO:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ 
ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ 
ID NO:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ 
ID NO: 146, SEQ ID NO: 147, SEQ ED NO: 148, SEQ ID NO: 149, SETQ ED NO: 150, SEQ 
ED NO: 151, SEQ ID NO: 1 52, SEQ ED NO: 153, SEQ ED NO: 1 54, SEQ ED NO: 1 55, SEQ 
ID NO: 156, SEQ ED NO: 157, SEQ ED NO: 158, SEQ ED NO: 159, SEQ ED NO: 160, SEQ 
ID NO: 161, SEQ ED NO: 1 62, SEQ ED NO: 163, SEQ ED NO: 1 64, SEQ ID NO: 1 65. SEQ 
ID NO:166, SEQ ID NO:167, SEQ ED NO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ 
ID NO: 171, SEQ ID NO: 1 72, SEQ ID NO: 1 73, SEQ ED NO: 1 74, SEQ ED NO: 1 75, SEQ 
ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ 
ID NO: 181, SEQ ED NO: 182, SEQ ID NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ 
ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ 
ID NO.191, SEQ ID NO:199, SEQ ID NO:193. SEQ ID NO:194, SEQ ID NO:195, SEQ 
ID NO:196, SEQ ED NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID NO:200. SEQ 
ID NO:201. SEQ ED NO:202. SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ 
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ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ 
ID NO:21 1, SEQ ID NO;212, SEQ ID NO.213, SEQ ID NO:214, SEQ ID NO:215, SEQ 
ID N0:2 1 6, SEQ ID NO:2 1 7, SEQ ID NO 2 1 8, SEQ ID NO:2 1 9, SEQ ID NO:220, SEQ 
ID NO:221, SEQ ID NO:222, SEQ ID N0 223, SEQ ID NO:224, SEQ ID NO:225, SEQ 
5 ID NO:226, SEQ ID NO:227, SEQ ID NO 228, SEQ ID NO:229, SEQ ID NO:230, SEQ 

ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ 
ID NO:236, SEQ ID NO;237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ 
ED NO:241, and SEQ ED NO:242 or portions of or the entire corresponding full-length 
amino acid sequences. 

10 By "identity" is meant a property of sequences that measures their similarity or 

relationship. Identity is measured by dividing the number of identical residues between 
two sequences (either full-length or a defined domain) by the total number of residues in 
the known sequence, or the domain of the known sequence, and multiplying the product 
by 100. Thus, two copies of exactly the same sequence have 100% identity, but sequences 

15 that are less highly conserved, and have replacements and substitutions, have a lower 

degree of identity. "Gaps" are spaces in an alignment that can result from aligning a novel 
sequence with a known sequence when the novel sequence has additions or deletions of 
amino acids in comparison with the known sequence. These gaps do not factor into the 
assessment of % identity using the sbove calculation. 

20 Those skilled in the art will recognize that several computer programs are also 

available for determining sequence identity using standard parameters, for example, Blast 
(Altschul, et al (1997) Nucleic Acids Res. 25:3389-3402), Blast2 (Altschul, et al (1990) 
J. Mol. Biol. 215:403-410), and Smith-Waterman (Smith, et al (1981) J. MoL Biol. 
147:195-197). 

25 In preferred embodiments, the invention features isolated, enriched, or purified 

nucleic acid molecules encoding a kinase polypeptide composing a nucleotide sequence 
that: (a) encodes a polypeptide having an amino acid sequence selected from the group 
consisting of those set forth in SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ 
ID NO:125, SEQ ID NO:126, SEQ ID NO 127, SEQ ID NO: 128, SEQ ID NO 129, SEQ 

30 ID NO: 130, SEQ ID NOT31, SEQ ID NO 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ 

ID NO: 135, SEQ ID NO: 136, SEQ ID NO 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ 
ID NO:140, SEQ ID NO:141, SEQ ID NO 142, SEQ ID NO:143, SEQ ID NO:144, SEQ 
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ID NO: 145. SEQ IDNOUo. SEQ ID NO: 147. SEQ ID NO: 148. SEQ ID NO: 149, SEQ 
ID NO: 150. SEQ ID NO: 151. SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO 154, SEQ 
ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ID NO 1 59, SEQ 
ID NO: 160, SEQ ID NO 161, SEQ ID NO. 162. SEQ ID NO. 163, SEQ ID NO: 164, SEQ 
ID NO: 1 65. SEQ ID NO: 1 66, SEQ ID NO 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ 
ID NO: 1 70, SEQ ID NO 1 71 , SEQ ID NO. 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, SEQ 
ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ED NO: 1 78, SEQ ID NO: 1 79, SEQ 
ED NO: 1 80, SEQ ID NO: 181, SEQ ED NO. 1 82, SEQ ED NO: 1 83, SEQ ED NO: 1 84, SEQ 
ID NO: 1 85, SEQ ED NO: 1 86. SEQ ED NO: 1 87, SEQ ED NO: 1 88, SEQ ID NO: 1 89, SEQ 
ID NO: 1 90, SEQ ID NO: 1 9 1 , SEQ ID NO. 1 99, SEQ ED NO: 1 93, SEQ ED NO: 1 94, SEQ 
ID NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ 
ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ED NO:203, SEQ ID NO:204, SEQ 
ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ 
ID NO:2 1 0, SEQ ID NO:2 1 1 , SEQ ED NO. 2 1 2, SEQ ID NO:2 1 3, SEQ ID NO:2 1 4, SEQ 
ID NO:215, SEQ ED NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ED NO:219, SEQ 
ID NO.220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ 
ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO.228, SEQ ID NO:229, SEQ 
ID NO:230, SEQ ED NO:231, SEQ ID NO:232, SEQ ED NO:233, SEQ ED NO:234, SEQ 
ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ED NO:238, SEQ ED NO:239, SEQ 
ID NO:240, SEQ ED NO:241, and SEQ ED NO:242, or the corresponding full-length 
amino acid sequence, or fragments thereof. A sequence that is substantially similar to a 
sequence selected from the group consisting of those set forth in SEQ ED NO: 122, SEQ ID 
NO:123, SEQ ID NO:124, SEQ ED NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID 
NO:128, SEQ ED NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID 
NO:133, SEQ ID NO:134, SEQ ED NO:135, SEQ ID NO.136, SEQ ID NO.137, SEQ ID 
NO:138, SEQ ED NO:139, SEQ ED NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID 
NO:143, SEQ ID NO:144, SEQ ED NO:145, SEQ ED NO:146, SEQ ID NO:147, SEQ ID 
NO:148, SEQ ED NO:149, SEQ ID NO:150, SEQ ID NO:151, SEQ ID NO:152, SEQ ID 
NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID 
NO:158, SEQ ED N0.159, SEQ ID NO.160, SEQ ID NO: 161, SEQ ID NO:162, SEQ ID 
NO:163, SEQ ID NO:164. SEQ ID NO:165. SEQ ID NO:166, SEQ ID NO:167, SEQ ID 
NO:168. SEQ ED NO:169, SEQ ID NO:170, SEQ ID NO:171. SEQ ID NO:172, SEQ ID 
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NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID 
NO: 1 7s'. SEQ ID NO: 1 79. SEQ ID NO: 1 80. SEQ ID NO: 181, SEQ ID NO: 1 82, SEQ ID 
NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID NO: 187, SEQ ID 
NO:18s'. SEQ ID NO: 189. SEQ ID NO- 1 90. SEQ ID NO:191, SEQ ID NO: 199, SEQ ID 
NO 193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197. SEQ ID 
NO 1 98. SEQ ID NO:l99, SEQ ID NO:200. SEQ ID NO:201, SEQ ID NO:202, SEQ ID 
NO 203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID 
NO 208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID 
NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:21 7, SEQ ID 
NO-218 SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID 
NO 223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO ^28 SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID 
NO-233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID 
NO:238, SEQ ID NO:239, SEQ ID NO:240. SEQ ID NO:241, and SEQ ID NO:242 will 
have at least 75% identity (preferably 90%, more preferably at least 95% and most 
preferably 99-1 00%) to the sequence selected from the group consisting of those set forth 
in SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID 
NO 126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO.129, SEQ ID NO:130, SEQ ID 
N0131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID 
N0136. SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID 
NO:141, SEQ ID NO:142. SEQ ID NO:143, SEQ ID NO: 1 44, SEQ ID NO:145. SEQ ID 
NO" 146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID 
N0151 SEQ ID NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID 
NO I56, SEQ ID NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO: 160, SEQ ID 
NO 161. SEQ ID NO:162. SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID 
NO- 166 SEQ ID NO:167. SEQ ID NO:168, SEQ ID NO:169, SEQ ID NO:170. SEQ ID 
N0171 SEQ ID NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID 
NO- 176 SEQ ID NO 177, SEQ ID NO: 178. SEQ ID NO: 179, SEQ ID NO: 180, SEQ ID 
NO-181 SEQ ID NO:182. SEQ ID NO:183, SEQ ID NO:l84, SEQ ID NO 185, SEQ ID 
NO- 186 SEQ ID NO: 187, SEQ ID NO: 188. SEQ ID NO: 189, SEQ ID NO 190. SEQ ID 
NO-191 SEQ ID NO:199. SEQ ID NO:193, SEQ ID NO:l94, SEQ ID N0195, SEQ ID 
NO- 196 SEQ ID NO:197. SEQ ID NO:198. SEQ ID NO:199. SEQ ID NO:200, SEQ ID 
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NO201. SEQ ID NO.202. SEQ ID NO:203, SEQ ID NO:204. SEQ ID NO:205. SEQ ID 
NO 206. SEQ ID NO:207. SEQ ID NO:208. SEQ ID NO:209, SEQ ID NO:210, SEQ ID 
NO:21 1, SEQ ID NO:212. SEQ ID NO:213, SEQ ID NO:2I4. SEQ ID NO:215, SEQ ID 
NO:2 1 6, SEQ ID NO:2 1 7. SEQ ID NO:2 1 8. SEQ ID NO:2 1 9, SEQ ID NO:220, SEQ ID 
NO:221, SEQ ID NO:222. SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID 
NO:226, SEQ ID NO:227. SEQ ID NO:22S, SEQ ID NO:229, SEQ ID NO:230, SEQ ID 
NO:231, SEQ ID NO:232. SEQ ID NO.233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID 
NO:236, SEQ ID NO.237, SEQ ED NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID 
NO:241, and SEQ ID NO:242; (b) is the complement of the nucleotide sequence of (a); (c) 
hybridizes under highly stringent conditions to the nucleotide molecule of (a) and encodes 
a naturally occurring kinase polypeptide; (d) encodes a kinase polypeptide having an 
amino acid sequence selected from the group consisting of those set forth in SEQ ID 
NO: 122, SEQ ID NO: 123. SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID 
NO: 127. SEQ ID NO: 128. SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO:131, SEQ ID 
NO: 1 32, SEQ ID NO: 1 33, SEQ ED NO: 1 34, SEQ ID NO: 1 35, SEQ ID NO: 1 36, SEQ ID 
NO:137, SEQ IDNO:138, SEQ IDNO:139, SEQIDNO:140, SEQ IDNO:141, SEQ ID 
NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID 
NO: 1 47, SEQ ID NO: 1 48. SEQ ID NO: 1 49, SEQ ED NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID 
NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54. SEQ ID NO: 1 55, SEQ ID NO. 1 56. SEQ ID 
NO: 157, SEQ ID NO: 1 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 6 1 , SEQ ID 
NO:162, SEQ ID NO:163. SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID 
NO: 1 67, SEQ ID NO: 1 68. SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID 
NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, SEQ LD NO: 1 75, SEQ ID NO: 1 76, SEQ ID 
NO: 1 77, SEQ ID NO: 1 78, SEQ ED NO: 1 79, SEQ ED NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID 
NO: 1 82, SEQ ID NO: 1 83. SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID 
NO: 1 87, SEQ ID NO: 1 88, SEQ ED NO: 1 89, SEQ ED NO: 1 90, SEQ ID NO: 1 91 , SEQ ID 
NO: 1 99, SEQ ID NO: 1 93, SEQ ED NO: 1 94, SEQ ED NO: 1 95, SEQ ID NO: 1 96. SEQ ID 
NO: 197, SEQ ID NO: 198, SEQ ED NO: 199, SEQ ED NO:200, SEQ ID NO:201, SEQ ID 
NO:202, SEQ ED NO:203, SEQ ED NO:204, SEQ ID NO:205, SEQ ED NO:206, SEQ ED 
NO.207, SEQ ED NO:208, SEQ ED NO:209, SEQ ID NO:210, SEQ ID NO:21 1. SEQ ID 
NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 
NO:217, SEQ ID NO:218. SEQ ID NO:219, SEQ ID NO:220. SEQ ID NO:221. SEQ ID 
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NO:222. SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID N0226, SEQ ID 
NO:227. SEQ ID NO:22S. SEQ ID NO:229, SEQ ID NO:230, SEQ ID Si) 2} 1 . SEQ ID 
NO:232. SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO. 235, SEQ ID NO 230. SEQ ID 
NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240. SEQ ID NO:241, and SEQ 
ID NO:242, or the corresponding full-length amino acid sequence, or fragments thereof. 
A sequence that is substantially similar to a sequence selected from the group consisting of 
those set forth in SEQ ID NO:122, SEQ ID NO:123, SEQ IDNO:124, SEQ ID NO:125, 
SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:12S, SEQ ID NO:129. SEQ ID NO:130, 
SEQIDNO:131 ) SEQIDNO:132, SEQ ID NO: 133, SEQ ED NO: 134, SEQ ID NO: 135, 
SEQ ID NO: 136, SEQ ED NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, 
SEQ ID NO: 1 4 1 , SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 1 44, SEQ ID NO: 1 45, 
SEQ ID NO: 146, SEQ ID NO:147, SEQ ID NO: 148, SEQ ID NO. 149, SEQ ID NO: 150, 
SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID NO: 1 53. SEQ ID NO: 1 54, SEQ ID NO: 1 55, 
SEQ ID NO: 1 56, SEQ ID NO: 1 57, SEQ ID NO: 1 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, 
SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. 
SEQ ID NO:166, SEQ ED NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID NO:170, 
SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, 
SEQ ID NO: 1 76, SEQ ID NO:l 77, SEQ ID NO: 178, SEQ ID NO: 1 79, SEQ ID NO: 1 80, 
SEQ ID NO" 181, SEQ ID NO: 182, SEQ ID NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, 
SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, 
SEQ ID NO:191, SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, 
SEQ ID NO: 1 96, SEQ ID NO: 1 97, SEQ ID NO: 1 98, SEQ ID NO: 1 99, SEQ ID NO:200, 
SEQ ID NO.201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, 
SEQ ID NO:206, SEQ ED NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, 
SEQ ID NO:21 1. SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:21 5. 
SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219. SEQ ID NO:220, 
SEQ ID NO:221 , SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225. 
SEQ ID NO:226. SEQ ID NO:227. SEQ ID NO:22S. SEQ ID NO:229, SEQ ID NO:230, 
SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, 
SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO.240. 
SEQ ID NO:241, and SEQ ID NO:242 will have at least 75% identity (preferably 00%, 
more preferably at least 95% and most preferably 99-100%) to the sequence of SEQ ID 
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NO: 122. SEQ ID NO: 123. SEQ ID NO: 124. SEQ ID NO: 125. SEQ ID NO: 1 2b, SEQ ID 
NO: 127, SEQ ID N0.12S. SEQ ID NO: 129. SEQ ID NO: 130. SEQ ID NO: 131, SEQ ID 
NO: 132. SEQ ID NO: 133, SEQ ID NO: 134. SEQ ID NO: 135. SEQ ID NO 136. SEQ ID 
NO 1 37, SEQ ID NO: 1 3S. SEQ ID NO 1 39. SEQ ID NO 1 40, SEQ ID NO 141, SEQ ID 
NO: 142, SEQ ID NO: 143, SEQ ID NO 144, SEQ ID NO 145. SEQ ID NO. 146. SEQ ID 
NO 147, SEQ IDNO:14S, SEQ ID NO 149, SEQ ID NO:150. SEQ ID NO:151, SEQ ID 
NO:152, SEQ ID NO:153, SEQ ID N0154, SEQ ID NO:155, SEQ ID NO:156, SEQ ID 
NO: 1 57, SEQ ID NO: 1 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 161, SEQ ID 
NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID 
NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO. 1 7 1 , SEQ ID 
NO. 1 72, SEQ ID NO 1 73, SEQ ED NO: 1 74, SEQ ED NO: 1 75, SEQ ID NO: 1 76, SEQ ID 
NO: 1 77, SEQ ID NO: 1 78, SEQ ED NO. 1 79, SEQ ED NO: 1 80, SEQ ED NO: 1 8 1 , SEQ ID 
NO: 1 82, SEQ ID NO: 1 83, SEQ ED NO: 1 84, SEQ ED NO: 1 85, SEQ ID NO: 1 86, SEQ ID 
NO: 187, SEQ ED NO: 1 88, SEQ ED NO: 1 89, SEQ ED NO: 1 90, SEQ ED NO: 1 9 1 , SEQ ID 
NO: 1 99, SEQ ID NO: 1 93, SEQ ED NO: 1 94, SEQ ED NO: 1 95, SEQ ID NO: 1 96, SEQ ID 
NO: 1 97, SEQ ID NO: 1 98, SEQ ED NO: 1 99, SEQ ED NO:200, SEQ ED NO:20 1 , SEQ ID 
NO:202, SEQ ED NO.203, SEQ ED NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 
NO:207, SEQ ID NO:208. SEQ ID NO:209, SEQ ID NO:210. SEQ ID NO:2 1 1 , SEQ ID 
NO:2 1 2, SEQ ID NO:2 1 3, SEQ ID NO:2 1 4. SEQ ID NO:2 1 5, SEQ ID NO:2 1 6, SEQ ID 
NO:2 1 7, SEQ ID NO:2 1 8, SEQ ED NO:2 1 9. SEQ ID NO:220, SEQ ED NO:22 1 , SEQ ID 
NO:222, SEQ ED NO:223. SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ED NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID 
NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ED NO:235, SEQ ID NO:236, SEQ ID 
NO:237, SEQ ED NO:23S. SEQ ID NO:239, SEQ ED NO:240, SEQ ID NO:241, and SEQ 
ID NO:242, except that it lacks one or more, but not all, of a domain selected from the 
group consisting of an N-terminal domain, a catalytic domain, a C-terminal domain, a 
coiled-coil structure region, a proline-rich region, a spacer region, an insert, and a C- 
terminal tail; (e) is the complement of the nucleotide sequence of (d); (0 encodes a 
polypeptide having an amino acid sequence selected from the group consisting of those set 
forth in SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO. 124, SEQ ID NO. 125, SEQ ID 
NO: 126, SEQ ID NO: 127. SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID 
NO: 1 3 1 , SEQ ID NO: 1 32. SEQ ID NO: 133, SEQ ID NO: 1 34. SEQ ID NO: 135, SEQ ID 
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SEQ ED NO: 163, 


SEQ 


ID 


NO: 164. 


SEQ 


ED 


NO 165. 


SEQ 


ID 


NO- 166 

1 ~ V-/ . 1 \J\J y 


SEO ID NO: 167, 


SEQ ID NO: 168, 


SEQ 


ID 


NO: 169, 


SEQ 


ID 


NO- 1 70, 


SEQ 


ID 


NOl 71 


SEO ID NO: 172, 


SEQ ID NO: 173, 


SEQ 


ED 


NO: 174, 


SEQ 


ED 


NO: 175, 


SEQ 


ID 


N0176 


SEQ ED NO: 177, 


SEQ ID NO: 178, 


SEQ 


ID 


NO: 179, 


SEQ 


ID 


NO: ISO, 


SEQ 


ID 


NO' 181 


SEO ED NO:182, 


SEQ ID NO: 183, 


SEQ 


ID 


NO: 184, 


SEQ 


ID 


NO 185, 


SEQ 


ID 


NO l 86 


SEO ID NO: 187, 


SEQ ID NO: 188, 


SEQ 


ID 


NO: 189, 


SEQ 


ID 


NO: 190, 


SEQ 


ID 


N0191 


SEO ED NO: 199, 


SEQ ID NO: 193, 


SEQ 


ID 


NO: 194, 


SEQ 


ID 


NO: 195, 


SEQ 


ID 


NO" 196 


SEQ ID NO:197, 


SEQ ID NO: 198, 


SEQ 


ID 


NO:199, 


SEQ 


ED 


NO.200, 


SEQ 


ID 


NO201 


SEQ ID NO:202, 


SEQ ID NO:203, 


SEQ 


ID 


NO:204, 


SEQ 


ID 


NO:205, 


SEQ 


ID 


NO206 


SEQ ID NO:207, 


SEQ ID NO:208, 


SEQ 


ID 


NO:209, 


SEQ 


ID 


NO:210, 


SEQ 


ID 


NO'21 1 


SEQ ID NO:212, 


SEQDDNO:213, 


SEQ 


ED 


NO:214, 


SEQ 


ID 


NO:215, 


SEQ 


ED 


NO:216, 


SEQ IDNO:217, 


SEQ IDNO:218, 


SEQ 


ED 


NO:219, 


SEQ 


ID 


NO:220, 


SEQ 


ID 




SEQ ID NO:222, 


SEQ ED NO-.223, 


SEQ 


ED 


NO:224, 


SEQ 


ED 


NO:225, 


SEQ 


ID 


NO:226, 


SEQ ID NO:227, 


SEQ ID NO:228, 


SEQ 


ED 


NO:229, 


SEQ 


ED 


NO:230, 


SEQ 


ID 


NO:231, 


SEQ ID NO:232, 


SEQ ED NO:233, 


SEQ 


ED 


NO:234, 


SEQ 


ED 


NO:235, 


SEQ 


ID 


NO:236, 


SEQ ID NO:237, 


SEQ ID NO:238, 


SEQ 


ID 


NO:239, 


SEQ 


ID 


NO:240, 


SEQ 


ID 


NO:241, 


and SEQ ED NO:242, or the corresponding full-length amino acid sequence, 


or 



fragments thereof. (The domain demarcations of the polypeptides of the invention are 
indicated in Table 2 by reference to the kinase domain.) A sequence that is substantially 
similar to a sequence selected from the group consisting of those set forth in SEQ ID 
NO: 122. SEQ ID NO: 123, SEQ ED NO: 124. SEQ ED NO: 125, SEQ ID NO: 126, SEQ ID 
NO: 127, SEQ ED NO: 128, SEQ ED NO: 129, SEQ ID NO: 130. SEQ ID NO 131, SEQ ID 
NO: 132. SEQ ID NO: 133, SEQ ID NO: 134. SEQ ID NO: 135. SEQ ID NO. 136, SEQ ID 
NO:137, SEQ ID NO:138, SEQ ED NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID 
NO: 142, SEQ ID NO: 143, SEQ ED NO: 144. SEQ ID NO: 145. SEQ ID NO: 146, SEQ ID 
NO: 147, SEQ ID NO: 148, SEQ ID NO. 149, SEQ ID NO: 150, SEQ ED NO: 151, SEQ ID 
NO:152, SEQ ED NO:153, SEQ ED NO:154. SEQ ID NO:155. SEQ ID NO:156, SEQ ID 
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NO: 157. SLQ ID NO:l>S. SEQ ID NO:l>9. SEQ ID NO: 160. SEQ ID NO:lM, SEQ ID 
NO:162. SEQ ID NO:163. SEQ ID N0:l<>4. SEQ ID NO:165. SHQ ID NO:166, SEQ ID 
NO: I 67. SEQ ID NO:16S. SEQ ID NO: 169. SEQ ID NO: 1 70, SEQ ID NO 171. SEQ ID 
NO: 172. SEQ ID NO: 173. SEQ ID NO: 1 74, SEQ ID NO: 175. SEQ ID NO 176, SEQ ID 
NO: 1 77. SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO 1 8 1 , SEQ ID 
NO: 182, SEQ ID NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID 
NO: 187, SEQ ID NO: 188, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 191, SEQ ID 
NO: 1 99, SEQ ID NO: 1 93, SEQ ED NO: 1 94, SEQ ID NO: 1 95, SEQ ID NO: 1 96, SEQ ID 
NO: 197, SEQ ED NO: 198, SEQ ED NO: 199, SEQ ED NO:200, SEQ ID NO:201, SEQ ID 
NO:202, SEQ ID NO:203, SEQ ED NO:204, SEQ ED NO:205, SEQ ED NO:206, SEQ ID 
NO:207, SEQ ED NO:208, SEQ ED NO:209, SEQ ED NO:2 1 0, SEQ ED NO:2 1 1 , SEQ ID 
NO:212, SEQ ED NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 
NO:217, SEQ ED NO:218, SEQIDNO:219, SEQ ID NO:220, SEQ IDNO:221, SEQ ID 
NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1 , SEQ ID 
NO:232, SEQ ED NO:233, SEQ ID NO:234, SEQ ED NO:235, SEQ ED NO:236, SEQ ID 
NO:237, SEQ ED NO:238, SEQ ED NO:239, SEQ ED NO:240, SEQ ID NO:241, and SEQ 
ID NO:242 will have at least 75% identity (preferably 90%, more preferably at least 95% 
and most preferably 99-100%) to the sequence selected from the group consisting of those 
set forth in SEQ ID NO: 122, SEQ ID NO: 123, SEQ ED NO: 124, SEQ ID NO: 125, SEQ 
ED NO:126, SEQ ED NO:127, SEQ ED NO:128, SEQ ED NO:129, SEQ ID NO:130, SEQ 
ED NO:131, SEQ ED NO:132, SEQ ED N0.133, SEQ ED NO:134, SEQ ID NO:135, SEQ 
ED NO:136, SEQ ED NO:137, SEQ ED NO:138, SEQ ED NO:139, SEQ ED NO:140, SEQ 
ED NO:141, SEQ ED NO:142, SEQ ID NO:143, SEQ ED NO:144, SEQ ID NO.145, SEQ 
ED NO:146, SEQ ID NO:147, SEQ ED NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ 
ID NO: 151, SEQ ED NO: 1 52, SEQ ID NO: 1 53, SEQ ED NO: 1 54, SEQ ID NO: 1 55, SEQ 
ID NO: 1 56, SEQ ED NO: 157, SEQ ID NO: 1 58, SEQ ED NO: 1 59, SEQ ID NO: 1 60, SEQ 
ID NO:161, SEQ ID NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ 
ID NO: 1 66, SEQ ED NO: 1 67. SEQ ID NO: 1 68, SEQ ID NO: 169, SEQ ID NO: 1 70, SEQ 
ID NO: 171, SEQ ED NO: 172. SEQ ID NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ 
ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ 
ID NO:lSl. SEQ ID NO:182, SEQ ID NO:183, SEQ ID NO:184, SEQ ID NO:185. SEQ 
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10 



15 



20 



25 



30 



ID NO: 1 86. SEQ ID NO: 1 87, SEQ ID NO: 1 88. SEQ ID NO: 1 89, SEQ ID NO: 1 90. SEQ 
ID NO:191. SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO: 194, SEQ ID NO:195. SEQ 
ID NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ ID NO: SEQ ID NO.200, SEQ 
ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO 204. SEQ ID NO:205. SEQ 
ID NO:206, SEQ ID NO:207, SEQ ID NO.208, SEQ ID NO:209, SEQ ID NO.210, SEQ 
ID NO:21 1. SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO 214, SEQ ID NO:215, SEQ 
ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:2l9, SEQ ID NO:220, SEQ 
ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225. SEQ 
ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ 
ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ 
ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ 
ID NO:241, and SEQ ID NO:242; (b) is the complement of the nucleotide sequence of (a), 
(c) hybridizes under highly stringent conditions to the nucleotide molecule of (a) and 
encodes a naturally occumng kinase polypeptide; (d) encodes a kinase polypeptide having 
an amino acid sequence selected from the group consisting of those set forth in SEQ ID 
NO:122, SEQ ED NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID 
NO: 12?! SEQ ED NO: 128, SEQ ED NO: 129, SEQ ID NO: 130, SEQ ID NO:131, SEQ ID 
NO:132,SEQIDNO:133,SEQEDNO:134,SEQIDNO:135,SEQIDNO:136,SEO ID 

NO: nv! SEQ ED NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID 
NO: 142, SEQ ID NO: 143, SEQ ED NO: 144, SEQ ID NO: 145. SEQ ID NO: 146, SEQ ID 
NO: 1 47, SEQ ED NO: 1 48, SEQ ID NO: 1 49. SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID 
NO: 152, SEQ ID NO: 153, SEQ ED NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID 
NO:157, SEQ ID NO:158, SEQ ED NO:159, SEQ ID NO:160, SEQ ID NO:161, SEQ ID 
NO:162, SEQ ED NO:163, SEQ ED NO:164, SEQ ID NO:165. SEQ ED NO:166, SEQ ID 
NO: 1 67, SEQ ID NO: 1 68, SEQ ED NO: 1 69, SEQ ED NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID 

NO: 172, SEQ ID NO: 173, SEQ ED NO: 174, SEQ ED NO: 175, SEQ ED NO: 176, SEQ ID 
NO: 1 77, SEQ ID NO: 1 78, SEQ ED NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID 

NO: 1 82, SEQ ED NO: 183, SEQ ED NO: 1 84, SEQ ED NO: 1 85, SEQ ID NO: 1 86, SEQ ID 
NO: 187, SEQ ID NO: 188, SEQ ED NO: 189, SEQ ED NO: 190, SEQ ID NO: 191, SEQ ID 
NO:199, SEQ ED NO:193, SEQ ED NO:194, SEQ ID NO:195. SEQ ID NO: 196, SEQ ID 
NO: 197, SEQ ID NO:198, SEQ ED NO:199, SEQ ED NO.200. SEQ ID NO:201, SEQ ID 
NO:202, SEQ ID NO:203. SEQ ID NO:204, SEQ ID NO.205. SEQ ID NO:206. SEQ ID 
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\'0:2<r. SEQ ID NO:20S. SEQ ID NO:209. SEQ ID NO:210. SEQ ID NO:21 1, SEQ ID 
NO:212. SEQ ID NO:213. SEQ ID NO:214, SEQ ID NO:2I5, SEQ ID NO:216, SEQ ID 
NO:217. SEQ ID NO:21S. SEQ ID NO:219. SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
NO:222, SEQ ID NO:223. SEQ ID NO:224. SEQ ED NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NQ:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID 
NO:232, SEQ ID NO:233, SEQ ID NO:234. SEQ ID NO:235, SEQ ID NO:236, SEQ ID 
NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ 
ID NO:242, or the corresponding full-length amino acid sequence, or fragments thereof. 
A sequence that is substantially similar to a sequence selected from the group consisting of 
those set forth in SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, 
SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, 
SEQ ID NO:131, SEQ ED NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, 
SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO.140, 
SEQ ID NO: 141, SEQ ID NO:142, SEQrDNO:143, SEQ ID NO: 144, SEQ ID NO:145, 
SEQ ID NO: 146, SEQ ID NO:147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, 
SEQ ID NO: 151, SEQ ID NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 155, 
SEQ ID NO: 156, SEQ ED NO: 157, SEQ ID NO: 158, SEQ ED NO: 159, SEQ ED NO: 160, 
SEQ ED NO: 161, SEQ ED NO: 162, SEQ ID NO:163, SEQ ED NO: 164, SEQ ED NO: 165. 
SEQ ED NO: 1 66, SEQ ED NO: 167, SEQ ED NO: 1 68, SEQ ED NO: 1 69, SEQ ID NO: 1 70, 
SEQ ED NO:l 71, SEQ ED NO: 172, SEQ ED NO: 173, SEQ ED NO: 174, SEQ ED NO: 175, 
SEQ ED NO: 1 76, SEQ ED NO: 1 77, SEQ ED NO: 1 78, SEQ ED NO: 1 79, SEQ ID NO: 1 80, 
SEQ ED NO:181, SEQ ED NO:182, SEQ ED NO:183, SEQ ED NO:184, SEQ ID NO:185, 
SEQ ED NO:186, SEQ ED NO:187, SEQ ID NO:188, SEQ ED NO:189, SEQ ID NO:190, 
SEQ ED NO: 191, SEQ ID NO: 199, SEQ ID NO: 193, SEQ ED NO: 194, SEQ ID NO: 195, 
SEQ ID NO: 1 96, SEQ ID NO: 1 97, SEQ ID NO: 1 98, SEQ ED NO: 1 99, SEQ ED NO:200, 
SEQ ED NO:201, SEQ ID NO:202, SEQ ED NO:203, SEQ ED NO:204, SEQ ED NO:205, 
SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ED NO:2 1 0, 
SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, 
SEQ ID NO:216, SEQ ID NO:217, SEQ ED NO:218, SEQ ED NO:219, SEQ ID NO:220, 
SEQ ED NO:221, SEQ ED NO:222, SEQ ED NO:223, SEQ ED NO:224, SEQ ED NO:225, 
SEQ ED NO:226, SEQ ED NO:227, SEQ ID NO.228, SEQ ED NO:229, SEQ ED NO:230, 
SEQ ID NO:231, SEQ ID NO:232. SEQ ID NO:233, SEQ ED NO:234, SEQ ED NO:235, 
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SEQ ID NO:236. SEQ ID NO:237, SEQ ID NO:238. SEQ ID NO:239, SEQ ID NO:24(). 
SEQ ID NO:241. and SEQ ID NO:242 will have at lcasl 75% identity (preferably 90%. 
more preferably at least 95% and most preferably 99-100%; to a domain of a polypeptide 
selected from the group consisting of those se, forth in SEQ ID NO:122, SEQ ID NO:I23. 
SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, 
SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID NO:133, 
SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, 
SEQ ID NO: 139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143, 
SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO: 148, 
SEQ ID NO: 149, SEQ ED NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, 
SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID NO: 1 57, SEQ ID NO: 1 58, 
SEQ ID NO: 159, SEQ ED NO: 160, SEQ ED NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, 
SEQ ED NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, 
SEQ ED NO: 169, SEQ ED NO: 170, SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, 
SEQ ID NO: 1 74, SEQ ED NO: 1 75 , SEQ ED NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, 
SEQ ED NO: 179, SEQ ED NO: 180, SEQ ID NO: 181, SEQ ID NO: 182, SEQ ID NO: 183, 
SEQ ED NO: 184, SEQ ED NO: 185, SEQ ID NO: 186, SEQ ID NO: 187, SEQ ID NO: 188, 
SEQ ED NO: 189, SEQ ED NO: 190, SEQ ED NO: 191, SEQ ID NO: 199, SEQ ED NO: 193, 
SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, 
SEQ ED NO:199, SEQ ED NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, 
SEQ ED NO:204, SEQ ED NO:205, SEQ ED NO:206, SEQ ID NO:207, SEQ ID NO:208, 
SEQ ED NO:209, SEQ ED NO:210, SEQ ED NO:21 1, SEQ ED NO:212, SEQ ID NO:213, 
SEQ ED NO:214, SEQ ED NO:215, SEQ ED NO:216, SEQ ED NO:217, SEQ ID NO:218, 
SEQ ED NO:219, SEQ ED NO:220, SEQ ED NO:221, SEQ ED NO:222, SEQ ED NO:223, 
SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226. SEQ ID NO:227, SEQ ID NO:228, 
SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231 , SEQ ID NO:232, SEQ ID NO:233. 
SEQ ID NO:234. SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:23S. 
SEQ ED NO:239. SEQ ED NO:240, SEQ ID NO:241, and SEQ ID NO:242. where the 
domain is selected from the group consisting of an N-terminal domain, a catalytic domain, 
a C-terminal domain, a coiled-coil structure region, a proline-nch region, a spacer region, 
an insert, and a C-terminal tail; (g) is the complement of the nucleotide sequence of (f); (h) 
encodes a polypeptide having an amino acid sequence selected from the group consisting 



WO 00/73469 



PCT/US00/ 14842 



of those set forth in SEQ ID NO: 122. SEQ ID NO: 123. SEQ ID NO: 124, SEQ ID 
NO 125, SEQ ID NO 126. SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID 
NO 130, SEQ ID NO 131. SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID 
N0135, SEQ ID NO 136. SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID 
5 NO. 140. SEQ ID NO 141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID 

NO. 145, SEQ ID N0146. SEQ ID NO:147, SEQ ID N0148, SEQ ID NO:149, SEQ ID 
NO150, SEQ ID NO: 151, SEQ ID NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID 
NO: 155, SEQ ID NO: 1 56, SEQ ID NO: 1 57, SEQ ID NO: 1 58, SEQ ED NO: 1 59, SEQ ID 
NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID 
10 NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ ID 

NO:170, SEQ ID NO:171, SEQ ID NO:172, SEQ ID NO:173, SEQ ID NO.174, SEQ ID 
NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ED NO: 1 78, SEQ ID NO: 1 79, SEQ ID 
NO: 180, SEQ ID NO: 181, SEQ ED NO: 182, SEQ ED NO: 183, SEQ ED NO: 184, SEQ ID 
NO: 185, SEQ ED NO: 186, SEQ ED NO: 187, SEQ ID NO: 188, SEQ ED NO: 189, SEQ ID 

15 NO:190, SEQ ED NO:191, SEQ ED NO.199, SEQ ED NO:193, SEQ ED NO:194, SEQ ED 

NO:195, SEQ ED NO:196, SEQ ED NO:197, SEQ ED NO:198, SEQ ED NO:199, SEQ ID 
NO:200, SEQ ED NO:201, SEQ ID NO:202, SEQ ED NO:203, SEQ ED NO:204, SEQ ED 
NO:205, SEQ ED NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID 
NO:210, SEQ ID NO:21 1, SEQ ED NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID 

20 NO:215, SEQ ID NO:216, SEQ ID NO:2I7, SEQ ID NO:218, SEQ ED NO:219, SEQ ID 

NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID 
NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID 
NO:230, SEQ ED NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID 
NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID 

25 NO:240, SEQ ED NO:241, and SEQ ID NO:242, or the corresponding full-length amino 

acid sequence, or fragments thereof. A sequence that is substantially similar to a sequence 
selected from the group consisting of those set forth in SEQ ED NO: 122, SEQ ED NO: 123, 
SEQ ED NO: 1 24, SEQ ID NO: 1 25, SEQ ED NO: 1 26, SEQ ED NO. 127, SEQ ID NO: 1 28, 
SEQ ED NO: 1 29, SEQ ED NO: 1 30, SEQ ED NO: 1 3 1 , SEQ ID NO: 1 32, SEQ ID NO: 1 33, 

30 SEQ ED NO: 1 34, SEQ ED NO: 1 35, SEQ ED NO: 1 36, SEQ ID NO: 137, SEQ ID NO: 1 38, 

SEQ ID NO:139, SEQ IDNO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143. 
SEQ ID NO: 144, SEQ ID NO: 145. SEQ ED NO: 1 46, SEQ ID NO: 147. SEQ ID NO: 148, 



WO 00/73469 



PCT/US00/I4842 



SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52. SEQ ID NO: 1 53, 
SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156. SEQ ID NO: 1 57, SEQ ID NO: 1 58. 
SEQ ID NO: 1 59, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 1 62. SEQ ID NO: 1 63. 
SEQ ID NO: 1 64, SEQ ID NO: 1 65. SEQ ID NO: 1 66, SEQ ID NO: 1 67. SEQ ID NO: 1 68. 
SEQ ID NO: 169, SEQ ID NO: 170, SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, 
SEQ ED NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ED NO: 1 77, SEQ ID NO: 1 78, 
SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 81 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, 
SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, 
SEQ ID NO: 1 89, SEQ ID NO: 190, SEQ ED NO: 191, SEQ ID NO: 1 99, SEQ ID NO: 1 93, 
SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, 
SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ED NO:202, SEQ ID NO:203, 
SEQ ED NO:204, SEQ ED NO:205, SEQ ED NO:206, SEQ ID NO:207, SEQ ID NO:208, 
SEQ ED NO:209, SEQ ID NO:210, SEQ ED NO:21 1, SEQ ID NO:212, SEQ ID NO:213, 
SEQ ED NO:214, SEQ ED NO:215, SEQ ED NO:216, SEQ ID NO:217, SEQ ID NO:218, 
SEQ ID NO:219, SEQ ID NO:220, SEQ ED NO:221, SEQ ID NO:222, SEQ ID NO:223, 
SEQ ED NO:224, SEQ ED NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, 
SEQ ED NO:229, SEQ ED NO:230, SEQ ED NO:231, SEQ ID NO:232, SEQ ID NO:233, 
SEQ ED NO:234, SEQ ED NO:235, SEQ ED NO:236, SEQ ED NO:237, SEQ ID NO:238, 
SEQ ED NO:239, SEQ ID NO:240, SEQ ED NO:241, and SEQ ID NO:242 will have at 
least 75% identity (preferably 90%, more preferably at least 95% and most preferably 99- 
100%) to the sequence selected from the group consisting of those set forth in SEQ ID 
NO:122, SEQ ID NO:123, SEQ ED NO:124, SEQ ED NO:125, SEQ ED NO:126, SEQ ID 
NO: 127, SEQ ID NO:128, SEQ ED NO: 129, SEQ ED NO:130, SEQ ED NO:131, SEQ ID 
NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID 
NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID 
NO: 142, SEQ ID NO: 143, SEQ ED NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID 
NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO.150, SEQ ED NO:151, SEQ ID 
NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID 
NO: 157, SEQ ID NO: 158, SEQ ID NO: 1 59, SEQ ED NO: 1 60, SEQ ID NO 1 6 1 , SEQ ID 
NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID 
NO: 167, SEQ ID NO: 168, SEQ ED NO: 169, SEQ ED NO: 170, SEQ ID NO:17L SEQ ID 
NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 176. SEQ ID 
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NO: 1 "7. SEQ ID NO: 1 78. SEQ ID NO 1 79. SEQ ID NO: 1 80. SEQ ID NO: 181. SEQ ID 
NO: 182, SEQ ID NO: 183. SEQ ID NO. 184. SEQ ID NO: 185. SEQ ID NO: 186, SEQ ID 
NO: 187, SEQ ID NO: 188. SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID NO: 191 , SEQ ID 
NO: 1 99, SEQ ID NO: 1 93. SEQ ID NO: 1 94. SEQ ID NO: 1 95, SEQ ID NO: 1 96, SEQ ID 
NO: 1 97, SEQ ID NO: 1 98, SEQ ID NO: 1 99, SEQ ID NO:200, SEQ ID NO:20 1 , SEQ ID 
NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205. SEQ ID NO:206, SEQ ID 
NO.207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID 
NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 
NO:2 1 7, SEQ ID NO:2 1 8, SEQ ID NO:2 1 9, SEQ ID NO:220, SEQ ID NO:22 1 , SEQ ID 
NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ED NO:230, SEQ ED NO:231, SEQ ID 
NO:232, SEQ ED NO:233, SEQ ED NO:234, SEQ ID NO:235, SEQ ED NO:236, SEQ ID 
NO:237, SEQ ED NO:238, SEQ ED NO:239, SEQ ID NO:240, SEQ ED NO:241, and SEQ 
ID NO:242; (b) is the complement of the nucleotide sequence of (a); (c) hybridizes under 
highly stringent conditions to the nucleotide molecule of (a) and encodes a naturally 
occurring kinase polypeptide; (d) encodes a kinase polypeptide having an amino acid 
sequence selected from the group consisting of those set forth in SEQ ED NO: 122, SEQ ID 
NO: 1 23, SEQ ID NO: 1 24, SEQ ID NO: 125, SEQ ID NO: 1 26, SEQ ID NO: 1 27, SEQ ID 
NO: 1 28, SEQ ED NO: 1 29, SEQ ID NO: 1 30, SEQ ID NO: 1 3 1 , SEQ ED NO: 1 32, SEQ ID 
NO: 133, SEQ ID NO: 134, SEQ ED NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID 
NO:138, SEQ ED NO:139, SEQ ID NO:140, SEQ ED NO:141, SEQ ID NO:142, SEQ ID 
NO: 1 43, SEQ ED NO: 1 44, SEQ ED NO: 1 45, SEQ ED NO: 1 46, SEQ ID NO: 1 47, SEQ ID 
NO: 148, SEQ ED NO: 149, SEQ ED NO: 150, SEQ ED NO: 151, SEQ ED NO: 152, SEQ ID 
NO: 1 53, SEQ ED NO: 1 54, SEQ ED NO: 1 55, SEQ ED NO: 1 56, SEQ ED NO: 1 57, SEQ ID 
NO: 1 58, SEQ ID NO: 1 59, SEQ ED NO: 1 60, SEQ ED NO: 1 6 1 , SEQ ID NO: 1 62, SEQ ID 
NO: 1 63, SEQ ED NO: 1 64, SEQ ED NO: 1 65. SEQ ED NO: 1 66, SEQ ID NO: 1 67, SEQ ID 
NO: 168, SEQ ED NO: 169, SEQ ID NO: 170, SEQ ED NO: 1 71, SEQ ID NO: 172, SEQ ID 
NO: 1 73, SEQ ED NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID 
NO. 1 78, SEQ ED NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID 
NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID 
NO: 188, SEQ ID NO:189, SEQ ID NO: 190, SEQ ID NO:191, SEQ ID NO:199, SEQ ID 
NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID 



WO 00/73469 



PCT/US00/ 14842 



24 



10 



15 



20 



25 



30 



NO:198. SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID 
NO:203. SEQ ID NO:204. SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID 
NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID 
NO:213. SEQ ID NO:214, SEQ ID NO:215, SEQ ID N0 216, SEQ ID NO:217, SEQ ID 
NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID 
NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID 
NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID 
NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242, or the 
corresponding full-length amino acid sequence, or fragments thereof. A sequence that is 



substantially similar to 


a sequence selected from the group 


consisting of those set forth in 
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SEO 


ED 
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SEQ 


ED 
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SEQ 


ID 
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ID 
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SEQ 


ID 
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SEQ 


ID 


NO:129, 


SEQ 


ED 


NO: 130, SEQ 


ID 


NO: 131, 


SEO 


ID NO: 132, 


SEQ 


ED 
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SEQ 


ED 


NO: 134, 


SEQ 


ID 


NO: 135, SEQ 


ED 
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SEO 
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SEQ 


ED 
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SEQ 


ED 
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SEQ 
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NO: 155, SEQ 


ID 


NO: 156, 


SEQ 


ID NO: 157, 


SEQ 


ED 


NO:158, 


SEQ 


ED 
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SEQ ID NO 22 . SEQ ID NO:228, SEQ ID NO:229. SEQ ID NO:230, SEQ ID NO:231. 
SEQ ID NO:232. SEQ ID NO:233. SEQ ID NO:234. SEQ ID NO:235. SEQ ID NO:236, 
SEQ ID NO:237. SEQ ID NO:23S, SEQ ID NO:239, SEQ ID NO:240. SEQ ID NO:241 , 
and SEQ ID NO:242 will have at least 75% identity (preferably 90%, more preferably ai 
5 least 95% and most preferably 99-100%) to the sequence of SEQ ID NO: 122, SEQ ID 

NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ED NO:126, SEQ ID NO.127, SEQ ID 
NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ED NO:131, SEQ ED NO:132, SEQ ID 
NO:133, SEQ ED NO:134, SEQ ED NO:135, SEQ ED NO:136, SEQ ED NO:137, SEQ ED 
NO:138, SEQ ID NO:139, SEQ ED NO:140, SEQ ED NO:141, SEQ ED NO:142, SEQ ID 
10 NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ED NO:147, SEQ ID 

NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID NO:151, SEQ ID NO:152, SEQ ID 
NO:153, SEQ ID NO:154, SEQ ED NO:155, SEQ ID NO: 156, SEQ ID NO:157, SEQ ID 
NO:158, SEQ ID NO:159, SEQ ID NO 160, SEQ ID NO:161, SEQ ID NO: 162, SEQ ID 
NO: 163, SEQ ID NO: 164, SEQ ID NO 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID 
1 5 NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ED NO: 171, SEQ ED NO: 1 72, SEQ ID 

NO: 1 73, SEQ ID NO: 1 74, SEQ ED NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID 
NO: 178, SEQ ID NO: 179, SEQ ID NO: 180, SEQ ID NO:181, SEQ ID NO: 182, SEQ ID 
NO. 1 83, SEQ ID NO: 1 84, SEQ ED NO: 1 85, SEQ ID NO:l 86, SEQ ID NO: 1 87, SEQ ID 
NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 191, SEQ ID NO: 1 99, SEQ ID 
20 NO: 1 93, SEQ ED NO: 1 94, SEQ ID NO: 1 95, SEQ ED NO: 1 96, SEQ ED NO: 1 97, SEQ ED 

NO: 1 98, SEQ ED NO: 1 99, SEQ ID NO:200, SEQ ID NO:20 1 , SEQ ID NO:202, SEQ ID 
NO:203, SEQ ID NO:204, SEQ ED NO:205, SEQ ED NO:206, SEQ ED NO:207, SEQ ID 
NO:208, SEQ ID NO:209, SEQ ED NO:210, SEQ ED NO:21 1, SEQ ED NO:212, SEQ ED 
NO:213, SEQ ID NO:214, SEQ ED NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID 
25 NO:218, SEQ ID NO:219, SEQ ID NO.220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID 

NO:223, SEQ ED NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO:228, SEQ ED NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID 
NO:233, SEQ ID NO:234, SEQ ED NO:235, SEQ ED NO:236, SEQ ID NO:237, SEQ ID 
NO:238, SEQ ED NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO 242, 
30 except that it lacks one or more of the domains selected from the group consisting of a N- 

terminal domain, a catalytic domain, a C-terminal domain, a coiied-coil structure region, a 
proline-rich region, a spacer region, an insert, and a C-terminal tail; or (i) is the 
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complement of the nucleotide sequence of (h). The domain demarcations of the 
polypeptides of the invention are indicated in Table 2 by reference to the kinase domain. 

The term "complement" refers to two nucleotides that can form multiple favorable 
interactions with one another. For example, adenine is complementary to thymine as they 
can form two hydrogen bonds. Similarly, guanine and cytosine are complementary since 
thev can form three hydrogen bonds. A nucleotide sequence is the complement of another 
nucleotide sequence if all of the nucleotides of the first sequence are complementary to all 
of the nucleotides of the second sequence. 

The term "domain" refers to a region of a polypeptide that contains a particular 
function. For instance, N-terminal or C-terminal domains of signal transduction proteins 
can serve functions including, but not limited to, binding molecules that localize the signal 
transduction molecule to different regions of the cell or binding other signaling molecules 
directly responsible for propagating a particular cellular signal. Some domains can be 
expressed separately from the rest of the protein and function by themselves, while others 
must remain part of the intact protein to retain function. The latter are termed functional 
regions of proteins and also relate to domains. 

The term "N-terminal domain" refers to the extracatalytic region located between 
the initiator methionine and the catalytic domain of the protein kinase. The N-terminal 
domain can be identified following a Smith-Waterman alignment of the protein sequence 
against the non-redundant protein database to define the N-terminal boundary of the 
catalytic domain. Depending on its length, the N-terminal domain may or may not play a 
regulatory role in kinase function. An example of a protein kinase whose N-terminal 
domain has been shown to play a regulatory role is PAK65, which contains a CRIB motif 
used for Cdc42 and rac binding (Burbelo, P.D. et al. (1995) J. Biol. Chem. 270, 29071- 
29074). The N-terminal domain of a protein kinase of the invention is that portion of the 
protein kinase to the amino-terminal side of the kinase domain where the kinase domain is 
identified in Table 2, herein. Further, in some cases, portions of the N-terminal domains 
of the protein k.nases of the invention have not been identified since the entire sequence is 
not available. However, with the methods described herein, the full-length sequences of 
the kinases of the invention can be determined and using the approaches described herein 
the N-terminal domain can be identified. 
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The term "catakiic domain" or "kinase domain" refers to a region of the protein 
kinase that is typically 25-300 amino acids long and is responsible for carrying out the 
phosphate transfer reaction from a high-energy phosphate donor molecule such as ATP or 
GTP to itself (autophosphoryiation) or to other proteins (exogenous phosphorylation). 
The catalytic domain of protein kinases is made up of 12 subdomains that contain highly 
conserv ed amino acid residues, and are responsible for proper polypeptide folding and for 
catalysis. The catalytic domain can be identified following a Smith-Waterman alignment 
of the protein sequence against the non-redundant protein database. The catalytic/kinase 
domains of the protein kinases of the invention are identified in Table 2, herein. Further, 
in some cases, the complete sequence of the catalytic/kinase domains of the protein 
kinases of the invention may not have been provided since the entire sequence is not 
available. However, with the methods described herein, the full-length sequences of the 
kinases of the invention can be determined, and using the approaches described herein, the 
catalytic/kinase domain can be identified. 

The term "catalytic activity", as used herein, defines the rate at which a kinase 
catalytic domain phosphorylates a substrate. Catalytic activity can be measured, for 
example, by determining the amount of a substrate converted to a phosphorylated product 
as a function of time. Catalytic activity can be measured by methods of the invention by 
holding time constant and determining the concentration of a phosphorylated substrate 
after a fixed period of time. Phosphorylation of a substrate occurs at the active-site of a 
protein kinase. The active-site is normally a cavity in which the substrate binds to the 
protein kinase and is phosphorylated. 

The term "substrate' 1 as used herein refers to a molecule phosphorylated by a 
kinase of the invention. Kinases phosphorylate substrates on serine/threonine or tyrosine 
amino acids. The molecule may be another protein or a polypeptide. 

The term "C-terminal domain" refers to the region located between the catalytic 
domain and the carboxy-terminal amino acid residue of the protein kinase. The C- 
terminal domain can be identified by using a Smith-Waterman alignment of the protein 
sequence against the non-redundant protein database to define the C-terminal boundary of 
the catalvtic domain or of any functional C-terminal extracatalytic domain. Depending on 
its length and amino acid composition, the C-terminal domain may or may not play a 
regulatory role in kinase function. An example of a protein kinase whose C-terminal 
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domain may play a regulatory role is PAK3 which contains a hetcrotrimeric G„ subunit- 
binding site near its C-terminus (Lccuw, T. ct al (1998) Nature. 391, 191-195). The C- 
terminal domain of a protein kinase of the invention is thai portion of the protein kinase to 
the carboxy-terminal side of the kinase domain where the kinase domain is identified in 
5 Table 2, herein In some cases, the C-terminal domains of the protein kinases of the 

invention have not been provided since the entire sequence is not available. However, 
with the methods described herein, the full-length sequences of the kinases of the 
invention can be determined, and using the approaches described herein, the C-terminal 

domain can be identified. 

1 0 The term "signal transduction pathway" refers to the molecules that propagate an 

extracellular signal through the cell membrane to become an intracellular signal. This 
signal can then stimulate a cellular response. The polypeptide molecules involved in 
signal transduction processes are typically receptor and non-receptor protein tyrosine 
kinases, receptor and non-receptor protein phosphatases, SRC homology 2 and 3 domains. 

1 5 phosphotyrosine binding proteins (SRC homology 2 (SH2) and phosphotyrosine binding 

(PTB and PH) domain containing proteins), proline-rich binding proteins (SH3 domain 
containing proteins), nucleotide exchange factors, and transcription factors. 

The term "coiled-coil structure region" as used herein, refers to a polypeptide 
sequence that has a high probability of adopting a coiled-coil structure as predicted by 

20 computer algorithms such as COILS (Lupas, A. (1996) Meth. Enzymology 266:513-525). 

Coiled-coils are formed by two or three amphipathic cc-helices in parallel. Coiled-coils can 
bind to coiled-coil domains of other polypeptides resulting in homo- or heterodimers 
(Lupas. A. (1991) Science 252:1 162-1 164). Coiled-coil-dependent oligomenzation has 
been shown to be necessary for protein function including catalytic activity of 

25 senne/threonine kinases (Roe, J. et al. (1 997) J. Biol. Chem. 272:5838-5845). Coiled-coil 

regions in the proteins of the invention can be identified using these methods. They may- 
be present as sub-domains of the N-termmal, kinase, or C-terminal domains of the 

polypeptides of the invention. 

The term "proline-rich region" as used herein, refers to a region of a protein kinase 
30 whose proline content over a given amino acid length is higher than the average content of 

this amino acid found in proteins (i.e., >10%). Proline-rich regions are easily discemable 
by visual inspection of amino acid sequences and quantitated by standard computer 
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sequence analysis programs such as the DNAStar program EditSeq. Proline-nch regions 
have been demonstrated to participate in regulators' protein -protein interactions. Among 
these interactions, those that are most relevant to this invention involve the "PxxP" proline 
nch motif found in certain protein kinases (i.e., human PAK1) and the SH3 domain of the 
adaptor molecule Nek (Galisteo, M L. et al. (1996) J. Biol. Chem. 271:20997-21000). 
Other regulatory interactions involving "PxxP" proline-nch motifs include the WW 
domain (Sudol, M. (1996) Prog. Biophys. Mol. Bio. 65:1 13-132). Proline rich regions in 
the proteins of the invention can be identified using these methods. They may be present 
as sub-domains of the N-terminal, kinase, or C-tenminal domains of the polypeptides of 
the invention. 

The term ^spacer region' 1 as used herein, refers to a region of the protein kinase 
located between predicted functional domains. The spacer region has no detectable 
homology to any amino acid sequence in the database, and can be identified by using a 
Smith-Waterman alignment of the protein sequence against the non-redundant protein 
database to define the C- and N-terminal boundaries of the flanking functional domains 
Spacer regions may or may not play a fundamental role in protein kinase function. 
Precedence for the regulatory role of spacer regions in kinase function is provided by the 
role of the sre kinase spacer in inter-domain interactions (Xu, W. et al. (1997) Nature 
385:595-602). Spacer regions in the proteins of the invention can be identified using these 
methods. They may be present as sub-domains of the N-terminal, kinase, or C-terminal 
domains of the polypeptides of the invention. 

The term "insert" as used herein refers to a portion of a protein kinase that is 
absent from a close homolog. Inserts may or may not by the product alternative splicing 
of exons. Inserts can be identified by using a Smith- Waterman sequence alignment of the 
protein sequence against the non-redundant protein database, or by means of a multiple 
sequence alignment of homologous sequences using the DNAStar program Megalign. 
Inserts may play a functional role by presenting a new interface for protein-protein 
interactions, or by interfering with such interactions. Insert regions in the proteins of the 
invention can be identified using these methods. They may be present as sub-domains of 
the N-terminal, kinase, or C-terminal domains of the polypeptides of the invention 
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The term "C-terminal tail" as used herein, refers to a C-ienmmal domain of a 
protein kinase, that by homology extends or protrudes past the C-termmal amino acid of 
its closest homolog. C-tcrminal tails can be identified by using a Smith-Waterman 
sequence alignment of the protein sequence against the non-redundant protein database, or 
5 by means of a multiple sequence alignment of homologous sequences using the DNAStar 

program Megalign. Depending on its length, a C-terminal tail may or may not play a 
regulatory role in kinase function. C-termmal tail regions in the proteins of the invention 
can be identified using these methods. They may be present as sub-domains of the N- 
terminal, kinase, or C-terminal domains of the polypeptides of the invention. 

10 Various low or high stringency hybridization conditions may be used depending 

upon the specificity and selectivity desired. These conditions are well-known to those 
skilled in the art. Under stringent hybridization conditions only highly complementary 
nucleic acid sequences hybridize. Preferably, such conditions prevent hybridization of 
nucleic acids having more than 1 or 2 mismatches out of 20 contiguous nucleotides, more 

15 preferably, such conditions prevent hybridization of nucleic acids having more than 1 or 2 

mismatches out of 50 contiguous nucleotides, most preferably, such conditions prevent 
hybridization of nucleic acids having more than 1 or 2 mismatches out of 100 contiguous 
nucleotides. In some instances, the conditions may prevent hybridization of nucleic acids 
having more than 5 mismatches in the full-length sequence. 

20 By stringent hybridization assay conditions is meant hybridization assay conditions 

at least as stringent as the following: hybridization in 50% formamide, 5X SSC, 50 mM 
NaH 2 P0 4 , pH 6.8, 0.5% SDS, 0.1 mg/mL sonicated salmon sperm DNA, and 5X Denhart 
solution at 42 °C overnight; washing with 2X SSC, 0.1% SDS at 45 °C; and washing with 
0.2X SSC, 0.1% SDS at 45 °C. Under some of the most stringent hybridization assay 

25 conditions, the second wash can be done with 0.1X SSC at a temperature up to 70 °C (pg. 

421, Berger*?/ ai (1987) Guide to Molecular Cloning Techniques , Mcth. Enzym. vol. 152, 
hereby incorporated by reference herein including any figures, tables, or drawings.). 
However, other applications may require the use of conditions falling between these sets 
of conditions. Methods of determining the conditions required to achieve desired 

30 hybridizations are well-known to those with ordinary skill in the art, and are based on 

several factors, including but not limited to, the sequences to be hybridized and the 
samples to be tested. 
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In other preferred embodiments, the invention features isolated, enriched, or 
purified nucleic acid molecules encoding kinase polypeptides, further comprising a vector 
or promoter effective to initiate transcription in a host cell. The invention also features 
recombinant nucleic acid, preferably in a cell or an organism. The recombinant nucleic 
5 acid may contain a sequence selected from the group consisting of those set forth in SEQ 

ID NO: 1, SEQ ID NO:2. SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6. 
SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 1 0, SEQ ID NO: 1 1 , SEQ ID 
NO: 12, SEQ ID NO: 13, SEQ ED NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ED NO: 17, 
SEQ ED NO: 18, SEQ ED NO: 19, SEQ ID NO:20, SEQ ED NO:21, SEQ ED NO:22, SEQ 

1 0 ID NO:23, SEQ ED NO:24, SEQ ED NO:25, SEQ ED NO:26, SEQ ED NO:27, SEQ ED 

NO:28, SEQ ID NO:29, SEQ ED NO:30, SEQ ED NO:31, SEQ ID NO:32, SEQ ED NO:33, 
SEQ ED NO:34, SEQ ED NO:35, SEQ ED NO:36, SEQ ED NO:37, SEQ ED NO:38, SEQ 
ID NO:39, SEQ ED NO:40, SEQ ID NO:41 , SEQ ID NO:42, SEQ ID NO:43, SEQ ED 
NO:44, SEQ ED NO:45, SEQ ED N0.46, SEQ ED NO:47, SEQ ID NO:48, SEQ ID NO:49, 

15 SEQ ED NO:50, SEQ ID NO:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ 

ID NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID 
NO:60, SEQ ID NO:61, SEQ ED NO:62, SEQ ED NO:63, SEQ ID NO:64, SEQ ED NO:65, 
SEQ ED NO:66, SEQ ID NO:67, SEQ ID NO:6S, SEQ ID NO:69, SEQ ID NO:70, SEQ 
ED NO:71, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID 

20 NO:76, SEQ ED NO:77, SEQ ID NO:78, SEQ ED NO:79, SEQ ED NO:80, SEQ ID NO:8 1 , 

SEQ ED NO:82, SEQ ED NO:83, SEQ ID NO:84, SEQ ED NO:85, SEQ ID NO:86, SEQ 
ED NO:87, SEQ ED NO:88, SEQ ED NO:89, SEQ ED NO:90, SEQ ED NO:91, SEQ ED 
NO:92, SEQ ED NO:93, SEQ ED NO:94, SEQ ED NO:95, SEQ ED NO:96, SEQ ED NO:97, 
SEQ ID NO:98, SEQ ED NO:99, SEQ ED NO:100, SEQ ED NO:101, SEQ ED NO:102, 

25 SEQ ED NO: 103, SEQ ID NO: 104, SEQ ID NO: 105, SEQ ID NO: 106, SEQ ED NO: 107, 

SEQ ED NO: 108, SEQ ED NO: 109, SEQ ED NO:l 10, SEQ ED NO:l 1 1, SEQ ID NOT 12, 
SEQ ED NO:l 13, SEQ ID NOT 14, SEQ ID NO: 1 15, SEQ ID NOT 16, SEQ ID NOT 17, 
SEQ ID NO:l 18, SEQ ID NOT 19, SEQ ID NO.T20, and SEQ ID NO: 1 2 1 , or a functional 
derivative thereof and a vector or a promoter effective to initiate transcription in a host 

30 cell. The recombinant nucleic acid can alternatively contain a transcriptional initiation 

region functional in a cell, a sequence complementary to an RNA sequence encoding a 
kinase polypeptide and a transcriptional termination region functional in a cell. Specific 
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vectors and host cell combinations are discussed herein. The recombinant nucleic acid can 
also contain the full-length sequence encoding the protein kinase, or a domain, for 
example. 

The term "vector" relates to a single or double-stranded circular nucleic acid 
5 molecule that can be transfected into cells and replicated within or independently of a cell 

genome. A circular double-stranded nucleic acid molecule can be cut and thereby 
linearized upon treatment with restriction enzymes. An assortment of nucleic acid vectors, 
restriction enzymes, and the knowledge of the nucleotide sequences cut by restriction 
enzymes are readily available to those skilled in the art. A nucleic acid molecule encoding 

10 a kinase can be inserted into a vector by cutting the vector with restriction enzymes and 

ligating the two pieces together. 

The term "transfecting 1 defines a number of methods to insert a nucleic acid vector 
or other nucleic acid molecules into a cellular organism. These methods involve a variety 
of techniques, such as treating the cells with high concentrations of salt, an electric field, 

1 5 detergent, or DMSO to render the outer membrane or wall of the cells permeable to 

nucleic acid molecules of interest or use of various viral transduction strategies. 

The term "promoter" as used herein, refers to nucleic acid sequence needed for 
gene sequence expression. Promoter regions vary from organism to organism, but are well 
known to persons skilled in the art for different organisms. For example, in prokaryotes, 

20 the promoter region contains both the promoter (which directs the initiation of RNA 

transcription) as well as the DNA sequences which, when transcribed into RNA, will 
signal synthesis initiation. Such regions will normally include those S'-non-coding 
sequences involved with initiation of transcription and translation, such as the TATA box, 
capping sequence, CAAT sequence, and the like. 

25 In preferred embodiments, the isolated nucleic acid comprises, consists essentially 

of, or consists of a nucleic acid sequence set forth in SEQ ID NO: 1 , SEQ ID NO:2, SEQ 
ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, 
SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NOT 1, SEQ ID NO: 12, SEQ ID NO. 13, SEQ ID 
NO:14, SEQ ID NOT 5, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, 

30 SEQ ID NO:20, SEQ ID NO:21 , SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ 

ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID 
NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35. 
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SEQ ID \O:30. SEQ ID NO:37. SI:Q ID NO:3S. SEQ ID NO:39, SEQ ID NO:40, SEQ 
ID NO:41. SEQ ID NO:42. SEQ ID NO:43. SEQ ID NO:44, SEQ ID NO:45, SEQ ID 
NO 40. SEQ ID NO:47. SEQ ID NO:48, SEQ ID NO.49. SEQ ID NO:50, SEQ ID NO:51. 
SEQ ID NO:52, SEQ ID NO:53, SEQ ID N0.54, SEQ ID NO:55, SEQ ID NO:56, SEQ 
ID NO:57, SEQ ED NO:58, SEQ ED NO:59, SEQ ED NO:60, SEQ ID NO:61, SEQ ID 
NO:62, SEQ ID NO:63. SEQ ID NO:64, SEQ ED NO:65, SEQ ID NO:66, SEQ ED NO:67, 
SEQ ED NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ ED NO:71, SEQ ED NO:72, SEQ 
ED NO:73, SEQ ED NO:74, SEQ ED NO:75, SEQ ED NO:76, SEQ ID NO:77, SEQ ED 
NO:78, SEQ ID NO:79, SEQ ED NO:80, SEQ ED NO:81, SEQ ED NO:82, SEQ ED NO:83, 
SEQ ED NO:84, SEQ ED NO:85, SEQ ED NO:86, SEQ ED NO:87, SEQ ID NO:88, SEQ 
ID NO:89, SEQ ED NO:90, SEQ ED NO:91. SEQ rD NO:92, SEQ ID NO:93, SEQ ID 
NO:94, SEQ ID NO:95. SEQ ED NO:96, SEQ ED NO:97, SEQ ID NO:98, SEQ ED NO:99, 
SEQ ED NO:100, SEQ ED NO:101, SEQ ID NO:102, SEQ ID NO:103, SEQ ID NO:104, 
SEQ ED NO: 105, SEQ ED NO: 106, SEQ ED NO: 107, SEQ ID NO: 108, SEQ ID NO: 109, 
SEQ ID NO:l 10, SEQ ID NO:ll 1, SEQ ED NO.l 12, SEQ ID NO:l 13, SEQ ID NO:114, 
SEQ ED NO: 115, SEQ ED NO: 1 1 6, SEQ ED NO: 1 1 7, SEQ ID NO: 118, SEQ ID NO: 1 1 9, 
SEQ ED NO: 120, and SEQ ED NO: 121, or the corresponding full-length sequence, encodes 
an amino acid sequence selected from the group consisting of those set forth in SEQ ED 
NO: 122, SEQ ED NO: 123, SEQ ED NO: 124, SEQ ID NO:125, SEQ ED NO: 126, SEQ ED 
NO:127, SEQ ED NO:128, SEQ ED NO:129, SEQ ED NO:130, SEQ ED NO:131, SEQ ED 
NO: 132, SEQ ED NO: 133, SEQ ED NO: 134, SEQ ID NO: 135, SEQ ED NO: 136, SEQ ID 
NO: 137, SEQ ED NO: 138, SEQ ED NO: 139, SEQ ED NO: 140, SEQ ED NO: 141, SEQ ID 
NO: 142, SEQ ED NO:143, SEQ ED NO:144, SEQ ED NO:145, SEQ ED NO:146, SEQ ED 
NO: 147, SEQ ID NO: 148, SEQ ED NO: 149, SEQ ID NO: 150, SEQ ED NO: 151, SEQ ID 
NO. 152, SEQ ID NO: 153, SEQ ED NO: 154, SEQ ID NO: 155, SEQ ED NO: 156, SEQ ED 
NO: 1 57, SEQ ED NO: 1 58, SEQ ED NO: 159, SEQ ED NO: 1 60, SEQ ED NO: 161, SEQ ID 
NO: 162, SEQ ID NO: 163, SEQ ED NO: 164, SEQ ED NO: 165. SEQ ED NO: 166, SEQ ID 
NO: 167, SEQ ID NO: 168, SEQ ED NO: 169, SEQ ID NO: 170, SEQ ED NO: I 71, SEQ ID 
NO: 1 72, SEQ ID NO: 1 73, SEQ ED NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID 
NO: 1 77, SEQ ID NO:178, SEQ ED NO:179, SEQ ID NO:180, SEQ ED NO:181, SEQ ID 
NO:182, SEQ ED NO:183, SEQ ID NO:184, SEQ ID NO:185, SEQ ED NO:186, SEQ ID 
NO:187, SEQ ED NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ ED NO:191, SEQ ID 
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NO:199, SEQ ID NO:193, SEQ ID NO: 194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID 
NO: 197, SEQ ID NO: 198. SEQ ID NO: 199, SEQ ID NO 200. SEQ ID NO:2()l, SEQ ID 
NO:202, SEQ ID NO:203. SEQ ID NO:204. SEQ ID NO:2()5, SEQ ID NO:206, SEQ ID 
NO:207, SEQ ID NO:208. SEQ ID NO:209. SEQ ID NO 210. SEQ ID NO:21 1, SEQ ID 

5 NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO 215. SEQ ID NO:21C. SEQ ID 

NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO.220. SEQ ID NO:221, SEQ ID 
NO:222, SEQ TD NO:223, SEQ ID NO:224, SEQ ID NO.225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID 
NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 

10 NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ 

ID NO:242, or the corresponding full-length amino acid sequence, a functional derivative 
thereof, or at least 10, 20, 40, 50, 75, 100, 200, 300 or 500 contiguous amino acids of a 
sequence selected from the group consisting of those set forth in SEQ ID NO: 122, SEQ ID 
NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID 

15 NO: 1 28, SEQ ID NO: 1 29, SEQ ID NO: 1 30, SEQ ID NO: 1 3 1 , SEQ ID NO: 1 32, SEQ ID 

NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID 
NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID 
NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID 
NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID 

20 NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID 

NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID NO:161, SEQ ID NO:162, SEQ ID 
NO: 163, SEQ ED NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID 
NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, SEQ ID 
NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID 

25 NO: 1 78, SEQ ID NO: 1 79, SEQ ED NO: 1 80, SEQ ID NO: 1 81 , SEQ ID NO: 1 82, SEQ ID 

NO: 183, SEQ ID NO: 184. SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID NO: 187, SEQ ID 
NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90. SEQ ID NO: 1 91 . SEQ ID NO: 1 99, SEQ ID 
NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196. SEQ ID NO: 197, SEQ ID 
NO:198, SEQ ID NO:199, SEQ ID NO:200. SEQ ID NO:201 . SEQ ID NO:202, SEQ ID 

30 NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206. SEQ ID NO:207, SEQ ID 

NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1. SEQ ID NO:212. SEQ ID 
NO:213, SEQ ID NO:214, SEQ ID NO:215. SEQ ID NO:216. SEQ ID NO:2H, SEQ ID 
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NO:21S, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:22 1 , SEQ ID XO:222, SEQ ID 
NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID 
NO 233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID 
NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242, or the 
corresponding full-length sequences or derivatives thereof. The nucleic acid may be 
isolated from a natural source by cDNA cloning or by subtractive hybridization. The 
natural source may be mammalian, preferably human, blood, semen, or tissue, and the 
nucleic acid may be synthesized by the triester method or by using an automated DNA 
synthesizer. 

The term "mammal" refers preferably to such organisms as mice, rats, rabbits, 
guinea pigs, sheep, and goats, more preferably to cats, dogs, monkeys, and apes, and most 
preferably to humans. 

In yet other preferred embodiments, the nucleic acid is a conserved or unique 
region, for example those useful for: the design of hybridization probes to facilitate 
identification and cloning of additional polypeptides, the design of PCR probes to 
facilitate cloning of additional polypeptides, obtaining antibodies to polypeptide regions, 
and designing antisense oligonucleotides. 

By "conserved nucleic acid regions", are meant regions present on two or more 
nucleic acids encoding a kinase polypeptide, to which a particular nucleic acid sequence 
can hybridize under lower stringency conditions. Examples of lower -stringency 
conditions suitable for screening for nucleic acid encoding kinase polypeptides are 
provided in Berger et ai (1987) Guide to Molecular Cloning Techniques , Meth. Enzym. 
vol. 152, hereby incorporated by reference herein in its entirety, including any drawings, 
figures, or tables. Preferably, conserved regions differ by no more than 5 out of 20 
nucleotides, even more preferably 2 out of 20 nucleotides or most preferably 1 out of 20 
nucleotides. 

By '"unique nucleic acid region" is meant a sequence present in a nucleic acid 
coding for a kinase polypeptide that is not present in a sequence coding for any other 
naturally occurring polypeptide. Such regions preferably encode 1 0 (preferably 25, more 
preferably 50, most preferably 75) or more contiguous amino acids selected from the 
group consisting of those set forth in SEQ ID NOT22, SEQ ID NO:123, SEQ ID NO:124, 
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SEQ ID NO: 125, SEQ ID NO: 1 26, SEQ ID NO: 1 27, SEQ ID NO: 128, SEQ ID NO: 129, 
SEQ ID NO: 1 30, SEQ ID NO 131. SEQ ID NO: 132, SEQ ID NO: 1 33. SEQ ID NO: 134. 
SEQ ID NO: 135, SEQ ID NO 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 139. 
SEQ ID NO: 140, SEQ ID NO 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, 
SEQ ID NO: 145, SEQ ID NO. 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, 
SEQ ID NO: 1 50, SEQ ID NO 1 5 1 , SEQ ID NO: 152, SEQ ID NO: 1 53, SEQ ID NO: 1 54, 
SEQ ID NO: 155, SEQ ID NO: 156. SEQ ID NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, 
SEQ ED NO: 160, SEQ ID NO: 161, SEQ ED NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, 
SEQ ED NO: 165. SEQ ID NO: 166, SEQ ED NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, 
SEQ ED NO: 1 70, SEQ ID NO: 1 71 , SEQ ED NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, 
SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, 
SEQ ED NO: 1 80, SEQ ED NO: 1 81 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, 
SEQ ED NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, 
SEQ ED NO: 1 90, SEQ ID NO: 1 91 , SEQ ED NO: 1 99, SEQ ID NO: 1 93, SEQ ID NO: 1 94, 
SEQ ED NO: 1 95, SEQ ED NO: 1 96, SEQ ID NO: 1 97, SEQ ID NO: 1 98, SEQ ID NO: 1 99, 
SEQ ED NO:200, SEQ ID NO:201 , SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, 
SEQ ED NO:205, SEQ ID NO:206, SEQ ED NO:207, SEQ ID NO:208, SEQ ID NO:209, 
SEQ ED NO:210, SEQ ED NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, 
SEQ ED NO:215, SEQ ED NO:216, SEQ ED NO:217, SEQ ID NO:218, SEQ ID NO:219, 
SEQ ED NO:220, SEQ ED NO:221, SEQ ED NO:222, SEQ ED NO:223, SEQ ID NO:224, 
SEQ ED NO:225, SEQ ED NO:226, SEQ ED NO:227, SEQ ED NO:228, SEQ ED NO.229, 
SEQ ED NO:230, SEQ ID NO:231, SEQ ED NO:232, SEQ ED NO:233, SEQ ED NO:234, 
SEQ ED NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, 
SEQ ID NO:240, SEQ ED NO:241, and SEQ ED NO:242, or functional derivatives thereof. 
In particular, a unique nucleic acid region is preferably of mammalian origin and 
preferably human. 

A second aspect of the invention features a nucleic acid probe for the detection of 
nucleic acid encoding a kinase polypeptide in a sample, wherein said polypeptide is 
selected from the group consisting of SEQ ID NO 122. SEQ ID NO: 123, SEQ ID NO: 124. 
SEQ ED NO: 1 25, SEQ ID NO: 1 26, SEQ ED NO: 1 27, SEQ ED NO: 1 28, SEQ ID NO: 1 29. 
SEQ ED NO: 130, SEQ ID NO: 131, SEQ ED NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, 
SEQIDNO:135. SEQ IDNO:136. SEQ ID NO:l 37, SEQ ID NO:138, SEQ ID NO: 1 39. 
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SEQ ID NO:240, SEQ ID NO:241, and SEQ ED NO:242. Preferably, the nucleic acid 
probe encodes a kinase polypeptide that is a fragment of the protein encoded by an amino 
acid sequence selected from the group consisting of those set forth in SEQ ID NO: 122, 
SEQ ED NO:123, SEQ ED NO:124, SEQ ID NO:125, SEQ ED NO:126, SEQ ID NO:127, 
SEQ ID NO: 128, SEQ ID NO. 129, SEQ ED NO: 130, SEQ ED NO: 131, SEQ ED NO: 132, 
SEQ ED NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ED NO:136, SEQ ED NO:137, 
SEQ ED NO: 138, SEQ ID NO: 139, SEQ ED NO. 140, SEQ ED NO: 141, SEQ ID NO: 142, 
SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ED NO:147, 
SEQ ED NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO 1 50, SEQ ID NO 1 5 1 , SEQ ID NO: 1 52, 
SEQ ID NO: 1 53, SEQ ED NO: 1 54, SEQ ID NO 1 55, SEQ ID NO 1 56. SEQ ID NO. 1 57, 
SEQ ID NO: 1 58. SEQ ID NO: 1 59, SEQ ID NO 1 60, SEQ ID NO 1 61 , SEQ ID NO 1 62, 
SEQ ID NO: 163, SEQ ID NO: 164. SEQ ID NO 165. SEQ ID NO lt>6. SEQ ID NO: 167, 
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SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 171, SEQ ID NO: 1 72 
SEQ ID NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76. SEQ ID NO: 1 77 
SEQ ID NO: 1 78. SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 181, SEQ ID NO: 1 82 
SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87 
5 SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 191, SEQ ID NO: 1 99 

SEQ ID NO: 1 93, SEQ ID NO: 1 94, SEQ ID NO: 1 95, SEQ ID NO: 1 96, SEQ ID NO: 1 97 
SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO 201, SEQ ID NO:202 
SEQ ED NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206. SEQ ID NO:207 
SEQ ED NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, 

10 SEQ ED NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, 

SEQ ED NO:218, SEQ ED NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, 
SEQ ED NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO.226, SEQ ID NO:227, 
SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232. 
SEQ ED NO:233, SEQ ID NO:234, SEQ ED NO:235, SEQ ID NO:236, SEQ ID NO:237, 

15 SEQ ID NO:238, SEQ ED NO:239, SEQ ED NO:240, SEQ ED NO:241, and SEQ ID 

NO:242, or the corresponding full-length amino acid sequences. The nucleic acid probe 
contains a nucleotide base sequence that will hybridize to a sequence selected from the 
group consisting of those set forth in SEQ ED NO:l, SEQ ED NO:2, SEQ ED NO:3, SEQ 
ID NO:4, SEQ ID NO:5, SEQ ED NO:6, SEQ ED NO:7, SEQ ID NO:8, SEQ ID NO:9, 

20 SEQ ID NO: 1 0, SEQ ID NO: 1 1 , SEQ ED NO: 1 2, SEQ ID NO: 1 3, SEQ ID NO: 1 4, SEQ 

ED NO: 1 5, SEQ ID NO: 1 6. SEQ ED NO: 1 7, SEQ ID NO: 1 8, SEQ ED NO: 1 9, SEQ ID 
NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25 
SEQ ID NO:26, SEQ ED NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ 
ED NO:3 1, SEQ ED NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ED NO:35, SEQ ID 

25 NO:36, SEQ ID NO:37, SEQ ID NO:3S. SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:4 1 

SEQ ED NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID N0 46, SEQ 
ED NO:47, SEQ ED NO:48, SEQ ID NO:49, SEQ ED NO:50, SEQ ED NO:51, SEQ ID 
NO:52, SEQ ED NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ED NO:56, SEQ ED NO:57 
SEQ ED NO:58, SEQ ID NO:59, SEQ ED NO:60, SEQ ED NO:6 1 , SEQ ID NO:62, SEQ 

30 ID NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ED NO:66, SEQ ED NO:67, SEQ ED 

NO:68, SEQ ED NO:69, SEQ ED NO:70, SEQ ID NO:71, SEQ ID NO:72, SEQ ID NO:73 
SEQ ED NO:74, SEQ ID NO:75. SEQ ID NO:76, SEQ ID NO:77. SEQ ID NO:7S. SEQ 
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ID NO:" 71 *, SEQ ID NO:8(). SEQ ID NO XI. SEQ ID NO:82. SEQ ID NO:83, SEQ ID 
NO.84. SEQ ID NO:85, SEQ ID NO:86. SEQ ID NO:87, SEQ ID NO:8S. SEQ ID NO:89, 
SEQ ID NO:90. SEQ ID NO:91. SEQ ID NO:92, SEQ ID NO:93. SEQ ID N'0:94. SEQ 
ID NO:95, SEQ ID NO:96, SEQ ID NO:97. SEQ ID NO:9S. SEQ ID NO:99, SEQ ID 
5 NO:100, SEQ ID NO:101, SEQ ID NO:102, SEQ ID NO:103, SEQ ID NO: 104, SEQ ID 

NO: 105, SEQ ID NO: 106, SEQ ID NO: 107, SEQ ID NO: 108, SEQ ID NO: 109, SEQ ID 
NO:l 10, SEQ ID NO:l 1 1, SEQ ID NO:l 12, SEQ ID NO:l 13, SEQ ID NO:l 14, SEQ ID 
NO:l 15, SEQ ED NO:l 16, SEQ ID NO:l 17, SEQ ID NO:l 18, SEQ ID NO:l 19, SEQ ID 
NO: 120, and SEQ ID NO: 121, or the corresponding full-length sequence, or a functional 

10 derivative thereof. 

In preferred embodiments, the nucleic acid probe hybridizes to nucleic acid 
encoding at least 6, 12, 75, 90, 105, 120, 150, 200, 250, 300 or 350 contiguous amino 
acids of a sequence selected from the group consisting of those set forth in SEQ ID 
NO: 122, SEQ ID NO: 123, SEQ ID NO 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID 

15 NO:127, SEQ ED NO:128, SEQ ID NO 129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID 

NO:132, SEQ ED NO: 133, SEQ ID NO: 134, SEQ ED NO: 135, SEQ ID NO: 136, SEQ ID 
NO: 137, SEQ ED NO: 138, SEQ ID NO 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID 
NO:142, SEQ ED NO:143, SEQ ED NO:144, SEQ ED NO:145, SEQ ID NO:146, SEQ ID 
NO: 1 47, SEQ ED NO: 1 48, SEQ ED NO: 1 49, SEQ ID NO: 1 50, SEQ ED NO: 1 5 1 , SEQ ID 

20 NO:152, SEQ ED NO:153, SEQ ED NO:154, SEQ ED NO:155, SEQ ED NO:156, SEQ ID 

NO:157, SEQ ED NO:158, SEQ ED NO:159, SEQ ED NO:160, SEQ ID NO:161, SEQ ID 
NO: 162, SEQ ED NO: 163, SEQ ED NO: 164, SEQ ED NO: 165. SEQ ED NO: 166, SEQ ED 
NO: 1 67, SEQ ED NO: 1 68, SEQ ID NO: 1 69, SEQ ED NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ED 
NO: 1 72, SEQ ED NO: 1 73. SEQ ED NO: 1 74, SEQ ED NO: 1 75, SEQ ED NO: 1 76, SEQ ID 

25 NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID 

NO: 182, SEQ ED NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID 
NO: 187, SEQ ID NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID 
NO: 199, SEQ ED NO: 1 93, SEQ ID NO: 1 94, SEQ ED NO: 1 95, SEQ ID NO: 1 96, SEQ ID 
NO: 197, SEQ ED NO: 198, SEQ ID NO: 199, SEQ ED NO:200, SEQ ID NO.201, SEQ ID 

30 NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 

NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:2 1 0, SEQ ID NO:2 1 1 , SEQ ID 
NO:212, SEQ ED NO:213. SEQ ID NO.214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 
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NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230. SEQ ID NO:231. SEQ ID 
NO:232, SEQ ID NO:233. SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 
5 NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241. and SEQ 

ID NO:242, or the corresponding full-length amino acid sequence, or functional 
derivatives thereof 

Methods for using the probes include detecting the presence or amount of kinase 
RNA in a sample by contacting the sample with a nucleic acid probe under conditions 

10 such that hybridization occurs and detecting the presence or amount of the probe bound to 

kinase RNA. The nucleic acid duplex formed between the probe and a nucleic acid 
sequence coding for a kinase polypeptide may be used in the identification of the sequence 
of the nucleic acid detected (Nelson et al., in Nonisotopic DNA Probe Techniques, 
Academic Press, San Diego, Kricka, ed., p. 275, 1992, hereby incorporated by reference 

1 5 herein in its entirety, including any drawings, figures, or tables). Kits for performing such 

methods may be constructed to include a container means having disposed therein a 
nucleic acid probe. 

In a third aspect, the invention describes a recombinant cell or tissue comprising a 
nucleic acid molecule encoding a kinase polypeptide selected from the group consisting of 
20 SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, 

SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, 
SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, 
SEQ ED NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO:141, 
SEQ ED NO: 142, SEQ ED NO: 143, SEQ ED NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, 
SEQ ID NO: 147, SEQ ED NO: 148, SEQ ED NO: 149, SEQ ID NO: 150, SEQ ID NO: 151, 
SEQ ID NO: 1 52, SEQ ID NO: 1 53, SEQ ED NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, 
SEQ ED NO: 1 57, SEQ ED NO: 1 58, SEQ ED NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 6 1 , 
SEQ ED NO: 162. SEQ ED NO: 163, SEQ ED NO: 164, SEQ ID NO: 165. SEQ ED NO: 166, 
SEQ ID NO: 1 67, SEQ ED NO: 1 68, SEQ ID NO: 169, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , 
SEQ ED NO: 1 72. SEQ ED NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 7b, 
SEQ ID NO: 1 77. SEQ ID NO 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , 
SEQ ED NO: 1 82. SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, 



25 



30 
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SEQ ID.NO IS". SEQ ID NO. 1 88. SEQ ID NO:lS9, SEQ ID NO: 190. SEQ ID NO: 191. 
SEQ ID NO:199. SEQ ID NO:193. SEQ ID NO:194. SEQ ID NO:195. SEQ ID NO:196. 
SEQ ID NO: 197. SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID KO:200, SEQ ID NO:201, 
SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, 
5 SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:2 1 0, SEQ ID NO:2 1 1 , 

SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ IDNO:215, SEQ ID NO:216, 
SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, 
SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, 
SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1 , 

10 SEQ ID NO:232. SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, 

SEQ ID NO:237, SEQ ID NO:23S, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, 
and SEQ ID NO:242. In such cells, the nucleic acid may be under the control of the 
genomic regulatory elements, or may be under the control of exogenous regulatory 
elements including an exogenous promoter. By "exogenous" it is meant a promoter that is 

15 not normally coupled in vivo transcriptionally to the coding sequence for the kinase 

polypeptides. 

The polypeptide is preferably a fragment of the protein encoded by an amino acid 
sequence selected from the group consisting of those set forth in SEQ ID NO: 122, SEQ ID 
NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID 

20 NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID 

NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ED NO:137, SEQ ID 
NO:138, SEQ ID NO:139, SEQ ED NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID 
NO:143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID 
NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 151, SEQ ID NO: 1 52, SEQ ID 

25 NO:153, SEQ ED NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ED NO:157, SEQ ID 

NO:158, SEQ IDNO:159, SEQ ID NO160, SEQ ID NO.161, SEQ ID NO:162, SEQ ID 
NO: 1 63, SEQ ID NO. 1 64, SEQ ID NO. 1 65. SEQ ID NO. 1 66, SEQ ID NO: 1 67, SEQ ID 
NO: 168, SEQ ID NO: 169, SEQ ED NO: 170, SEQ ED NO: 171, SEQ ID NO: 172, SEQ ID 
NO: 1 73, SEQ ED NO: 1 74, SEQ ID NO: 1 75, SEQ ED NO: 1 76, SEQ ED NO: 1 77, SEQ ID 

30 NO: 178, SEQ ID NO: 179, SEQ ED NO: 180, SEQ ID NO: 181, SEQ ID NO: 182, SEQ ID 

NO:183, SEQ ID NO:184, SEQ ID NO:185, SEQ ED NO:186, SEQ ED NO:187, SEQ ID 
NO:188. SEQ ID NO:189, SEQ ID NO:190. SEQ ID NO:191, SEQ ID NO:199, SEQ ID 
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NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID 
NO:198, SEQ ID NO: 199. SEQ ID NO:200, SEQ ID NO:201. SEQ ID NO:202.SEQ ID 
NO:203, SEQ ID NO. 204, SEQ ID NO:205, SEQ ID NO: 206. SEQ ID NO:207, SEQ ID 
NO:20S, SEQ ID NO:209, SEQ ID NO.210. SEQ ID NO:21 1, SEQ ID NO:212. SEQ ID 
5 NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID 

NO:218, SEQ ID NO:219. SEQ ID NO:220. SEQ ID NO:221, SEQ ID NO:222. SEQ ID 
NO:223, SEQ ED NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO 227, SEQ ID 
NO:228, SEQ ID NO;229, SEQ ID NO:230, SEQ ID NO:23 1 , SEQ ID NO:232, SEQ ID 
NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID 

1 0 NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ED NO:241 , and SEQ ID NO:242, or the 

corresponding full-length amino acid sequence. By "fragment," is meant an amino acid 
sequence present in a kinase polypeptide. Preferably, such a sequence comprises at least 
10, 20, 40, 50, 75, 100, 200, or 300 contiguous amino acids a sequence selected from the 
group consisting of those set forth in SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, 

15 SEQ ID NO:125, SEQ ED NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, 

SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, 
SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, 
SEQ ED NO:140, SEQ ED NO:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, 
SEQ ID NO: 145, SEQ ED NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, 

20 SEQ ED NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54, 

SEQ ED NO: 155, SEQ ED NO:156, SEQ ID NO:157, SEQ ED NO: 158, SEQ ED NO: 159, 
SEQ ED NO:160, SEQ ED NO:161, SEQ ED NO:162, SEQ ED NO:163, SEQ ED NO:164, 
SEQ ED NO:165. SEQ ED NO:166, SEQ ED NO:167, SEQ ID NO:168, SEQ ED NO:169, 
SEQ ED NO: 170, SEQ ED NO: 171, SEQ ED NO: 172, SEQ ED NO: 173, SEQ ED NO: 174, 

25 SEQ ED NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID NO: 178, SEQ ED NO: 179, 

SEQ ED NO: 1 80. SEQ ED NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83. SEQ ID NO: 1 84, 
SEQ ED NO: 1 85, SEQ ED NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, 
SEQ ED NO: 1 90, SEQ ID NO: 1 91 , SEQ ID NO: 1 99. SEQ ID NO: 1 93, SEQ ID NO: 1 94, 
SEQ ED NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, 

30 SEQ ID NO:200, SEQ ED NO:201 , SEQ ID NO:202. SEQ ID NO:203, SEQ ID NO:204. 

SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208. SEQ ID NO. 209, 
SEQ ED NO:210, SEQ ID NO:21 1. SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, 
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SEQ ID NO:215. SEQ ID NO:216, SHQ ID NO:217. SEQ ID NO:21S, SEQ ID NO:219. 
SEQ ID NO:220. SEQ ID NO:221. SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224. 
SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO. 228, SEQ ID NO:229. 
SEQ ID NO:230, SEQ ID NO:23 1 , SEQ ID NO:232. SEQ ID NO:233. SEQ ID NO:234, 
5 SEQ ID NO:235. SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, 

SEQ ID NO:240. SEQ ID NO:241, and SEQ ID NO:242, or of the corresponding full- 
length amino acid sequence, or a functional derivative thereof. 

In a fourth aspect, the invention features an isolated, enriched, or purified kinase 
polypeptide selected from the group consisting of SEQ ID NO: 122, SEQ ID NO: 123, SEQ 

10 ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID N0.127, SEQ ID NO:128, SEQ 

ID NO:129, SEQ ID NO 130, SEQ IDNO:131, SEQ ID NO:132, SEQ ID NO:I33, SEQ 
ID NO: 134, SEQ ID NO: 135, SEQ ED NO: 136. SEQ ID NO: 137, SEQ ID NO: 138, SEQ 
ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143, SEQ 
ED NO:144, SEQ ED NO:145, SEQ ED NO:146, SEQ ED NO:147, SEQ ED NO:148, SEQ 

15 ED NO: 149, SEQ ED NO: 150, SEQ ED NO: 151, SEQ ED NO: 152, SEQ ED NO: 153, SEQ 

ID NO:154, SEQ ED NO:155, SEQ ED NO:156, SEQ ED NO:157, SEQ ED NO:158, SEQ 
ID NO: 159, SEQ ED NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, SEQ 
ID NO:164, SEQ ED NO:165. SEQ ID NO:166, SEQ ID NO:167, SEQ ID NO:168, SEQ 
ED NO: 169, SEQ ID NO: 170, SEQ ED NO. 171, SEQ ID NO: 172, SEQ ID NO: 173, SEQ 

20 ID NO: 1 74, SEQ ID NO: 1 75, SEQ ED NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ 

ED NO: 179, SEQ ED NO: 180, SEQ ID NO: 181, SEQ ED NO: 182, SEQ ID NO: 183, SEQ 
ED NO: 1 84, SEQ ED NO: 1 85, SEQ ID NO: 1 86, SEQ ED NO: 1 87, SEQ ID NO: 188, SEQ 
ID NO:189, SEQ ED NO:190, SEQ ED NO:191, SEQ ID NO:199, SEQ ID NO:193, SEQ 
ID NO: 194, SEQ ED NO: 195, SEQ ED NO: 196, SEQ ED NO: 197, SEQ ID NO: 198, SEQ 

25 ID NO: 199, SEQ ED NO:200, SEQ ED NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ 

ID NO:204, SEQ ED NO:205, SEQ ID NO:206, SEQ ED NO:207, SEQ ID NO:208, SEQ 
ID NO:209, SEQ ED NO:210, SEQ ED NO:21 1, SEQ ED NO:212, SEQ ID NO:213, SEQ 
ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ED NO:217, SEQ ID NO:218, SEQ 
ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ 

30 ED NO:224, SEQ ED NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ 

ID NO:229, SEQ ID NO:230. SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ 
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ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:23S. SEQ 
ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ ID NO:242. 

By "isolated" in reference to a polypeptide is meant a polymer ol amino acids (2 or 
more amino acids) conjugated to each other, including polypeptides that are isolated from 
a natural source or that are synthesized. The isolated polypeptides of the present invention 
are unique in the sense that they are not found in a pure or separated state in nature. Use 
of the term "isolated" indicates that a naturally occurring sequence has been removed from 
its normal cellular environment. Thus, the sequence may be in a cell-free solution or 
placed in a different cellular environment. The term does not imply that the sequence is 
the only amino acid chain present, but that it is essentially free (about 90 - 95% pure at 
least) of non-amino acid material naturally associated with it. 

By the use of the term "enriched" in reference to a polypeptide is meant that the 
specific amino acid sequence constitutes a significantly higher fraction (2 - 5 fold) of the 
total amino acid sequences present in the cells or solution of interest than in normal or 
diseased cells or in the cells from which the sequence was taken. This could be caused by 
a person by preferential reduction in the amount of other amino acid sequences present, or 
by a preferential increase in the amount of the specific amino acid sequence of interest, or 
by a combination of the two. However, it should be noted that enriched does not imply 
that there are no other amino acid sequences present, just that the relative amount of the 
sequence of interest has been significantly increased. The term significant here is used to 
indicate that the level of increase is useful to the person making such an increase, and 
generally means an increase relative to other amino acid sequences of about at least 2-fold, 
more preferably at least 5- to 10-fold or even more. The term also does not imply that 
there is no amino acid sequence from other sources. The other source of amino acid 
sequences may, for example, comprise amino acid sequence encoded by a yeast or 
bacterial genome, or a cloning vector such as pUC19 The term is meant to cover only 
those situations in which man has intervened to increase the proportion of the desired 
amino acid sequence. 

It is also advantageous for some purposes that an ammo acid sequence be in 
purified form. The term "purified" in reference to a polypeptide does not require absolute 
purity (such as a homogeneous preparation); instead, it represents an indication that the 
sequence is relatively purer than in the natural environment. Compared to the natural level 
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this level should be at least 2-5 fold greater (eg., in terms of mg mL). Purification of at 
least one order of magnitude, preferably two or three orders, and more preferably four or 
five orders of magnitude is expressly contemplated. The substance is preferably free of 
contamination at a functionally significant level, for example 90%, 95%, or 99% pure. 

In preferred embodiments, the kinase polypeptide is a fragment of the protein 
encoded by an amino acid sequence selected from the group consisting of those set forth in 
SEQ ID NO: 1 22, SEQ ID NO: 1 23, SEQ ID NO: 1 24, SEQ ID NO: 125, SEQ ID NO: 1 26, 
SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, 
SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, 
SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ED NO: 141, 
SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, 
SEQ ID NO: 1 47, SEQ ID NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , 
SEQ ID NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, 
SEQ ID NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ED NO:160, SEQ ED NO:161, 
SEQ ED NO: 162, SEQ ID NO: 163, SEQ ED NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, 
SEQ ED NO: 167, SEQ ED NO: 168, SEQ ID NO: 169, SEQ ID NO: 170, SEQ ID NO: 171, 
SEQ ED NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, 
SEQ ED NO: 1 77, SEQ ED NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , 
SEQ ED NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84. SEQ ID NO: 1 85, SEQ ID NO: 1 86, 
SEQ ED NO:187, SEQ ED NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ ID NO:191, 
SEQ ED NO.199, SEQ ED NO:193, SEQ ID NO: 194, SEQ ID NO:^, SEQ ID NO:196, 
SEQ ED NO: 197, SEQ ED NO: 198, SEQ ED NO: 199, SEQ ED NO:200, SEQ ID NO.201, 
SEQ ED NO:202, SEQ ID NO:203, SEQ ED NO:204, SEQ ED NO:205, SEQ ED NO:206, 
SEQ ED NO:207, SEQ ED NO:208, SEQ ED NO:209, SEQ ED NO:210, SEQ ID NO:21 1, 
SEQ ED NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ED NO.215, SEQ ED N0.216, 
SEQ ED NO:217, SEQ ED NO:218, SEQ ED NO:219, SEQ ED NO:220, SEQ ED NO:221, 
SEQ ID NO:222, SEQ ID NO:223, SEQ ED NO:224, SEQ ID NO:225, SEQ ID NO:226, 
SEQ ED NO:227, SEQ ED NO:228, SEQ ED NO:229, SEQ ID NO:230, SEQ ED NO:23 1 , 
SEQ ED NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ED NO:235, SEQ ID NO:236, 
SEQ ED NO:237, SEQ ID NO:238, SEQ ID NO:239. SEQ ID NO:240, SEQ ID NO:241, 
and SEQ ID NO:242, or the corresponding full-length amino acid sequences. Preferably, 
the kinase polypeptide contains at least 10, 20, 40, 50, 75. 100, 200, or 300 contiguous 
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amino acids a sequence selected from the group consisting of those set forth in SEQ ID 
NO: 122, SEQ ID NO" 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID 
NO:127, SEQ ID NO 128. SEQ ID NO: 129, SEQ ID NO:130, SEQ ID NO:131, SEQ II) 
NO: 132, SEQ ID NO 133, SEQ ID NO: 134. SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID 
5 NO: 1 37, SEQ ID NO. 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID 

NO: 1 42, SEQ ID NO: 1 43, SEQ ID NO: 1 44, SEQ ID NO: 1 45, SEQ ID NO: 1 46, SEQ ID 
NO: 1 47, SEQ ID NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID 
NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID 
NO: 1 57, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 1 60, SEQ ID NO: 161, SEQ ID 

10 NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID 

NO: 1 67, SEQ ID NO: 1 68, SEQ ED NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID 
NO: 1 72, SEQ ID NO: 1 73, SEQ ED NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID 
NO: 1 77, SEQ ID NO: 1 78, SEQ ED NO: 1 79, SEQ ED NO. 1 80, SEQ ID NO: 1 8 1 , SEQ ID 
NO: 1 82, SEQ ED NO: 1 83, SEQ ED NO: 1 84, SEQ ED NO: 1 85, SEQ ID NO: 1 86, SEQ ID 

15 NO: 1 87, SEQ ED NO: 1 88, SEQ ED NO: 1 89, SEQ ED NO: 1 90, SEQ ED NO: 1 9 1 , SEQ ED 

NO: 1 99, SEQ ED NO: 1 93, SEQ ED NO: 1 94, SEQ ED NO: 1 95, SEQ ED NO: 1 96, SEQ ED 
NO: 197, SEQ ED NO: 198, SEQ ED NO: 199, SEQ ED NO:200, SEQ ED NO:201, SEQ ID 
NO:202, SEQ ED NO:203, SEQ ED NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 
NO:207, SEQ ED NO:208, SEQ ED NO:209, SEQ ED NO:210, SEQ ID NO:21 1, SEQ ID 

20 NO:212, SEQ ED NO:213, SEQ ID NO:214, SEQ ED NO:215, SEQ ID NO:216, SEQ ID 

NO:217, SEQ ID NO:218, SEQ ED NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ED N0.229, SEQ ID NO:230, SEQ ID NO:23 1 , SEQ ID 
NO:232, SEQ ID NO:233, SEQ ED NO:234, SEQ ED NO:235, SEQ ID NO:236, SEQ ID 

25 NO:237, SEQ ED NO:238, SEQ ED NO:239, SEQ ID NO:240, SEQ ID NO:24 1 , and SEQ 

ID NO:242, or the corresponding full-length amino acid sequence, or a functional 
derivative thereof. 

In preferred embodiments, the kinase polypeptide comprises an amino acid 
sequence having (a) an amino acid sequence selected from the group consisting of those 
30 set forth in SEQ ED NO: 122, SEQ ED NO: 123, SEQ ED NO: 124, SEQ ID NO: 125, SEQ 

ID NO: 126, SEQ EDNO:127, SEQ ED NO:128, SEQ ID NO:129, SEQ ID NO: 130, SEQ 
ID NO:131. SEQ ED NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ 
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ID NO: 1 36. SEQ ID NO: 1 37. SEQ ID NO: I 38. SEQ ID NO: 139, SEQ ID NO: 140, SEQ 
ID NO: 141. SEQ ID NO: 142. SEQ ID NO: 143. SEQ ID NO: 144. SEQ ID NO: 145. SEQ 
ID NO: 146. SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149. SEQ ID NO: 150, SEQ 
ID NO: 151. SEQ ID NO: 1 52, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ 
ID NO:156, SEQ ID NO:157, SEQ ED NO:15S, SEQ ED NO:159, SEQ ID NO:160, SEQ 
ID NO:161, SEQ ED NO:162, SEQ ED NO:163, SEQ ED NO:164, SEQ ED NO:165. SEQ 
ID NO:166, SEQ ED NO:167, SEQ ED NO:168, SEQ ED NO:169, SEQ ED NO:170, SEQ 
ID NO: 1 7 1 , SEQ ED NO: 1 72, SEQ ID NO: 1 73, SEQ ED NO: 1 74, SEQ ED NO: 1 75, SEQ 
ID NO: 1 76, SEQ ID NO: 1 77, SEQ ED NO: 1 78, SEQ ED NO: 1 79, SEQ ED NO: 1 80, SEQ 
ID NO: 181, SEQ ID NO: 182, SEQ ID NO: 183, SEQ ED NO: 184, SEQ ED NO: 185, SEQ 
ID NO: 186, SEQ ID NO: 187, SEQ ID NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ 
ID NO: 191, SEQ ED NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ 
ID NO: 196, SEQ ID NO: 197, SEQ ED NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ 
ID NO:201, SEQ ED NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ 
ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ 
ID NO:21 1, SEQ ED NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ 
ED NO:216, SEQ ED NO:217, SEQ ED NO:218, SEQ ED NO:219, SEQ ID NO:220, SEQ 
ED NO:221, SEQ ED NO:222, SEQ ED NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ 
ED NO:226, SEQ ED NO:227, SEQ ED NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ 
ED NO:231, SEQ ED NO:232, SEQ ED NO:233, SEQ ED NO:234, SEQ ID NO:235, SEQ 
ID NO:236, SEQ ID NO:237, SEQ ED NO:238, SEQ ED NO:239, SEQ ID NO:240, SEQ 
ID NO:241, and SEQ ED NO:242; (b) an amino acid sequence selected from the group 
consisting of those set forth in SEQ ED NO: 122, SEQ ED NO: 123, SEQ ID NO: 124, SEQ 
ID NO:125, SEQ ID NO:126, SEQ ED NO:127, SEQ ID NO.128, SEQ ID NO:129, SEQ 
ID NO: 130, SEQ ID NO: 131, SEQ ED NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ 
ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ 
ID NO:140, SEQ IDN0.141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ 
ID NO: 145, SEQ ID NO: 146, SEQ ED NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ 
ID NO: 150, SEQ ID NO:151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ 
ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ 
ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ 
ID NO: 165. SEQ ID NO: 166, SEQ ID NO: 167. SEQ ID NO: 168, SEQ ID NO: 169, SEQ 
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ID NO: 1 70, SEO ID NO: 1 7 1 . SEQ ID NO: 1 72, SEQ ID NO: 173, SEQ ID NO: 1 74, SEQ 
ID NO- 175 SEO ID NO: 1 76. SEQ ID NO: 1 77. SEQ ID NO: 178, SEQ ID NO: 1 79, SEQ 
ID NO 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 S3, SEQ ID NO: 1 84. SEQ 
ID NO' 185 SEQ ID NO: 186, SEQ ID NO: 187. SEQ ID NO: 188. SEQ ID NO: 189. SEQ 
ID NO- 190 SEQ ID NO: 191, SEQ ID NO: 199, SEQ ID NO: 193. SEQ ID NO: 194, SEQ 
ID NO 195, SEQ ID NO:196, SEQ ID NO:197. SEQ ID NO:198. SEQ ID NO:199, SEQ 
ID NO-200 SEQ ID NO:201, SEQ ID NO:202. SEQ ID NO:203, SEQ ID NO:204, SEQ 
ID NO 205 SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:20S, SEQ ID NO:209, SEQ 
ID NO 210, SEQ ID NO:21 1, SEQ ED NO:212, SEQ ED NO:213, SEQ ED NO:214, SEQ 
ID NO 215 SEQ ID NO:2l6, SEQ ED NO:217, SEQ ED NO:218, SEQ ID NO:219, SEQ 
ED NO 220 SEQ ED NO:221. SEQ ED NO:222, SEQ ED NO:223, SEQ ID NO:224, SEQ 
ID N0 225 SEQ ED NO:226, SEQ ID NO:227, SEQ ED NO:228, SEQ ED NO:229, SEQ 
ED NO 230 SEQ ED NO:231, SEQ ED NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ 
ED NO:235, SEQ ED NO:236, SEQ ED NO:237, SEQ ED NO:238, SEQ ID NO:239, SEQ 
ID NO:240, SEQ ED NO:241, and SEQ ID N0.242, except that it lacks one or more, but 
not all of a domain selected from the group consisting of an N-termmal domain, a 
catalytic domain, a C-termtnal domain, a co.led-coil structure region, a prol,ne-nch reg.on, 
a spacer region, an insert, and a C-termmal tail; (O an amino acid sequence of a domain of 
a polypeptide selected from the group consisting of those set forth in SEQ ED NO: 122, 
SEQ ED NO- 123, SEQ ED NO: 124, SEQ ID NO: 125. SEQ ID NO: 126, SEQ ID NO: 127, 
SEQ ED NO 128, SEQ ID NO:129, SEQ ED NO:130, SEQ ID NO:131, SEQ ED NO:132. 
SEQ ID N0133, SEQ ED NO:134, SEQ ED NO:135, SEQ ID NO:136, SEQ LD NO:137, 
SEQ ED N0138, SEQ ED NO:139, SEQ ED NO:140, SEQ ED NO:141, SEQ ED NO:142, 
SEQ ED NO:143, SEQ ED NO:144, SEQ ED NO:145, SEQ ED NO:146, SEQ ED NO:147, 
SEQ ED NO- 148, SEQ ED NO: 149, SEQ ED NO: 150, SEQ ED NO: 151, SEQ ED NO. 152, 
SEQEDN0153,SEQIDNO:154,SEQIDNO:155,SEQIDNO : 156,SEQIDNO:157. 

SEQ ED NO- 1 58, SEQ ID NO: 1 59, SEQ ED NO: 1 60, SEQ ED NO: 1 6 1 , SEQ ID NO: 1 62. 
SEQ ID NO 163. SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID NO: 167. 
SEQ ED NO- 168, SEQ ID NO: 169, SEQ ID NO: 170, SEQ ED NO: 171, SEQ ID NO: 172. 
SEQ ED NO 1 73, SEQ ED NO: 1 74. SEQ ID NO: 175, SEQ ID NO: 1 76, SEQ ID NO 1 77. 
SEQ ED NO. 1 78. SEQ ID NO: 1 79, SEQ ID NO. 1 80. SEQ ID NO: 181. SEQ ID NO 
SEQ ID NO:183. SEQ ID NO.184, SEQ ID NO:185, SEQ ID NO.186, SEQ ED NO:187. 
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SEQ ID 


NO: 


18S. 


SEQ 


ID 


NO: ISO. 


SEQ 


ID 


NO: 190. 


SEQ ID NO: 191, 


SEQ 


ID NO: 


199, 


SEQ ID 


NO: 193, 


SEQ 


ID 


NO: 194, 


SEQ 


ID 


NO: 195. 


SEQ ID NO: 196, 


SEQ 


ID NO: 197, 


SEQ ID 


NO:198. 


SEQ 


ID 


NO: 199, 


SEQ 


ID 


NO:200, 


SEQ ED NO:201, 


SEQ 


ID NO: 


202, 


SEQ ID 


NO: 


203, 


SEQ 


ID 


NO:204, 


SEQ 


ED 


NO:205, 


SEQ ED NO:206, 


SEQ 


ID NO:207, 


SEQ ID 


NO: 


208, 


SEQ 


ED 


NO:209, 


SEQ 


ED 


NO:210, 


SEQ ID NO:21 1, 


SEQ 


ED NO: 


212, 


SEQ ID 


NO: 


213, 


SEQ 


ED 


NO:214. 


SEQ 


ID 


NO:215, 


SEQEDNO:216, 


SEQ 


ID NO: 


217, 


SEQ ID 


NO: 


218, 


SEQ 


ID 


NO:219, 


SEQ 


ID 


NO:220, 


SEQ ID NO:221, 


SEQ 


ID NO: 


222, 


SEQ ID 


NO. 


223, 


SEQ 


ED 


NO:224, 


SEQ 


ID 


NO:225, 


SEQ ED NO:226, 


SEQ 


ED NO 


227, 


SEQ ID 


NO. 


228, 


SEQ 


ID 


NO:229, 


SEQ 


ID 


NO:230, 


SEQ ID NO:231, 


SEQ 


ID NO 


232, 


SEQ ED 


NO 


233, 


SEQ 


ED 


NO:234, 


SEQ 


ED 


NO:235, 


SEQ ID NO:236, 


SEQ 


ID NO:237, 


SEQ ID 


NO. 


238, 


SEQ 


ID 


NO:239, 


SEQ 


ID 


NO:240, 


SEQEDNO:241, 


and SEQ ID 





NO:242 where the domain is selected from the group consisting of an N-terminal domain, 
a catalytic domain, a C-terminal domain, a coiled-coil structure region, a proline-nch 



region, a spacer region, an insert, and a C-terminal tail; or (d) an amino acid sequence 
selected from the group consisting of those set forth in SEQ ID NO: 122, SEQ ID NO: 123, 



SEQ 


ED 


NO: 124, 


SEQ 


ED 


NO:125, 


SEQ 


ED 


NO: 126, 


SEQ 


ED 


NO:127, 


SEQ 


ED 


NO: 128, 


SEQ 


ED 


NO:129, 


SEQ 


ED 


NO:130, 


SEQ 


ED 


NO:131, 


SEQ 


ED 


NO:132, 


SEQ 


ID 


NO:133, 


SEQ 


ED 


NO:134, 


SEQ 


ID 


NO:135, 


SEQ 


ED 


NO:136, 


SEQ 


ED 


NO:137, 


SEQ 


ID 


NO:138, 


SEQ 


ED 


NO:139, 


SEQ 


ED 


NO: 140, 


SEQ 


ID 


NO:141, 


SEQ 


ED 


NO:142, 


SEQ 


ID 


NO:143, 


SEQ 


ID 


NO: 144, 


SEQ 


ID 


NO:145, 


SEQ 


ID 


NO:146, 


SEQ 


ID 


NO:147, 


SEQ 


ID 


NO. 148, 


SEQ 


ED 


NO:149, 


SEQ 


ED 


NO: 150, 


SEQ 


ID 


NO:151, 


SEQ 


ID 


NO:152, 


SEQ 


ID 


NO:153, 


SEQ 


ED 


NO: 154, 


SEQ 


ID 


NO: 155, 


SEQ 


ID 


NO:156, 


SEQ 


ID 


NO:157, 


SEQ 


ID 


NO:15S, 


SEQ 


ED 


NO: 159, 


SEQ 


ED 


NO: 160, 


SEQ 


ED 


NO:161, 


SEQ 


ID 


NO:162, 


SEQ 


ID 


NO:163, 


SEQ 


ED 


NO: 164, 


SEQ 


ID 


NO:165. 


SEQ 


ID 


NO: 166, 


SEQ 


ED 


NO: 167, 


SEQ 


ID 


NO:168, 


SEQ 


ID 


NO: 169, 


SEQ 


ID 


NO: 170, 


SEQ 


ID 


NO:171, 


SEQ 


ID 


NO:172, 


SEQ 


ID 


NO:173. 


SEQ 


ID 


NO:174, 


SEQ 


ID 


NO:175, 


SEQ 


ID 


NO: 176, 


SEQ 


ID 


NO: 177, 


SEQ 


ID 


NO:17S, 


SEQ 


ED 


NO: 179, 


SEQ 


ED 


NO:180, 


SEQ 


ED 


NO:181, 


SEQ 


ID 


NO:182, 


SEQ 


ID 


NO:183. 


SEQ 


ED 


NO: 184, 


SEQ 


ID 


NO:185, 


SEQ 


ID 


NO: 186, 


SEQ 


ED 


NO: 187, 


SEQ 


ED 


NO:188, 


SEQ 


ED 


NO: 189, 


SEQ 


ID 


NO: 190, 


SEQ 


ED 


NO:191, 


SEQ 


ID 


NO: 199, 


SEQ 


ID 


NO:193, 


SEQ 


ID 


NO: 194, 


SEQ 




NO:195, 


SEQ 


ID 


NO: 196, 


SEQ 


ED 


NO: 197, 


SEQ 


ED 


NO:198, 


SEQ 


ED 


NO: 199, 


SEQ 


ID 


NO:200, 


SEQ 


ID 


NO:201, 


SEQ 


ED 


NO:202, 


SEQ 


ID 


NO:203, 


SEQ 


ID 


NO:204, 


SEQ 


ED 


NO:205, 


SEQ 


ED 


NO:206, 


SEQ 


ED 


NO:207, 


SEQ 


ID 


NO:208. 
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SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, 
SEQ ID NO:214, SEQ IDNO:215, SEQ ID NO:216. SEQ ID NO:217, SEQ ID NO:218. 
SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223. 
SEQ ID NO:224, SEQ ID NO:225. SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228. 
5 SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232 ! SEQ ID NO:233, 

SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, 
SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242, except that it 
lacks one or more, but not all, of the domains selected from the group consisting of a C- 
terminal domain, a catalytic domain, an N-termmal domain, a spacer region, a proline-rich 
10 region, a coiled-coil structure region, an insert, and a C-terminal tail. (The domain 

demarcations of the polypeptides of the invention are indicated in Table 2 by reference to 
the kinase domain.) 

The polypeptide can be isolated from a natural source by methods well-known in 
the art. The natural source may be mammalian, preferably human, blood, semen, or tissue, 

15 and the polypeptide may be synthesized using an automated polypeptide synthesizer. The 

isolated, enriched, or purified kinase polypeptide is preferably selected from the group 
consisting of those set forth in SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ 
ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:12S, SEQ ID NO:129, SEQ 
ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ 

20 ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ 

ID NO: 140, SEQ ID NO: 141, SEQ ID NO:142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ 
ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ 
ID NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ED NO: 153, SEQ ID NO: 154, SEQ 
ID NO: 155, SEQ ID NO: 156, SEQ DD NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ 

25 ID NO:160, SEQ ID NO:161, SEQ ID NO:162, SEQ ID NOT63, SEQ ID NO:164, SEQ 

ID NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ 
ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74. SEQ 
ID NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID NO: 178, SEQ ID NO: 179, SEQ 
ID NO: 180, SEQ ID NO: 181, SEQ ID NO: 182, SEQ ID NO: 183, SEQ ID NO: 184, SEQ 

30 ID NO: 1 85, SEQ ID NO: 1 S6. SEQ ID NO: 187, SEQ ID NO: 1 88. SEQ ID NO: 1 89, SEQ 

ID NO:190, SEQ ID NO:191, SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO: 194. SEQ 
ID NO:195, SEQ ID NO:196, SEQ ID NO:197. SEQ ID N0198. SEQ ID NO:199. SEQ 
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ID NO:200, SEQ ID NO 201. SEQ ID NO:202, SEQ ID NO 203, SEQ ID NO:204, SEQ) 
ID NO:205, SEQ ID NO 206. SEQ ID NO:207, SEQ ID NO 208. SEQ ID NO:209, SEQ 
ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213. SEQ ID NO:214, SEQ 
ID NO:215, SEQ ID N0 216. SEQ ID NO:217, SEQ ID NO 218, SEQ ID NO:219, SEQ 
5 ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO. 223, SEQ ID NO:224, SEQ 

ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ 
ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ 
ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ 
ID NO:240, SEQ ID NO:241, and SEQ ID NO:242A. 

10 In some embodiments the invention includes a recombinant kinase polypeptide 

selected from the group consisting of SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, 
SEQ ID NO:125, SEQ ED NO:126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, 
SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, 
SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, 

15 SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, 

SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, 
SEQ ED NO: 1 50, SEQ ID NO: 151, SEQ ED NO: 1 52, SEQ ED NO: 1 53, SEQ ID NO: 1 54, 
SEQ ED NO: 155, SEQ ED NO: 156, SEQ ID NO: 157, SEQ ED NO: 158, SEQ ID NO: 159, 

Qui rvv i r . a a r \ 

SEQ ED NO:160, SEQ ED NO:161, SEQ ID NO:162, SEQ ID NO:163, SEQ ID NO:164, 
20 SEQ ED NO: 165. SEQ ED NO: 166, SEQ ED NO: 167, SEQ ED NO: 168, SEQ ED NO: 169, 

SEQ ED NO: 170, SEQ ID NO: 171, SEQ ID NO: 172, SEQ ED NO: 1-73, SEQ ID NO:l 74, 
SEQ ED NO: 1 75, SEQ ED NO: 1 76, SEQ ED NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, 
SEQ ED NO: 1 80, SEQ ED NO: 1 8 1 , SEQ ED NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, 
SEQ ED NO: 1 85, SEQ ED NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 188, SEQ ID NO: 1 89, 
25 SEQ ED NO: 1 90, SEQ ED NO: 1 9 1 , SEQ ED NO: 1 99, SEQ ED NO: 1 93, SEQ ED NO: 1 94, 

SEQ ED NO: 195, SEQ ED NO: 196, SEQ ED NO: 197, SEQ ED NO: 198, SEQ ED NO: 199, 
SEQ ED NO:200, SEQ ED NO:201, SEQ ED NO:202, SEQ ED NO:203, SEQ ED NO:204, 
SEQ ED NO:205, SEQ ED NO:206, SEQ ED NO:207, SEQ ED NO:208, SEQ ID NO:209, 
SEQ ED NO:210, SEQ ED NO:21 1, SEQ ED NO:212, SEQ ID NO:213, SEQ ED NO:214, 
30 SEQ ED NO:215, SEQ ED NO:216, SEQ ED NO:217, SEQ ED NO:218, SEQ ED NO:219, 

SEQ ED NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, 
SEQ ED NO:225, SEQ ID NO:226, SEQ ID NO:227. SEQ ID NO:228, SEQ ID NO:229, 
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10 



15 



20 



30 



SEQ ID NO 230, SEQ ID NO:231, SEQ ID NO:232. SEQ ID NO:233, SEQ ID NO:234. 
SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238. SEQ ID NO:239, 
SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242. By "recombinant kinase 
polypeptide" is meant a polypeptide produced by recombinant DNA techniques such that 
it is distinct from a naturally occurring polypeptide either in its location (e.g., present in a 
different cell or tissue than found in nature), purity or structure. Generally, such a 
recombinant polypeptide will be present in a cell in an amount different from that 
normally observed in nature. 

In a fifth aspect, the invention features an antibody (e.g., a monoclonal or 
polyclonal antibody) having specific binding affinity to a kinase polypeptide or a kinase 
polypeptide domain or fragment where the polypeptide is selected from the group 
consisting of SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ 
ID NO: 1 26, SEQ ID NO: 1 27, SEQ ID NO: 1 28, SEQ ID NO: 1 29, SEQ ID NO: 1 30, SEQ 



ID NO:131 
BDNO:136 
IDNO:141 
ED NO: 146 
IDNO:151 
ED NO: 156 
ID NO:161 
ID NO: 166 
EDNO:171 
ID NO: 176 
ED NO:181 
ID NO: 1 86 
IDNO:191 
ED NO:196 
EDNO:201 
ID NO. 206 
EDNO:211 
IDNO:216 
ED NO:221 



SEQ ED NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ 
SEQ ED NO: 1 37, SEQ ED NO: 138, SEQ ED NO: 1 39, SEQ ID NO: 1 40, SEQ 
SEQ ED NO: 142, SEQ ED NO: 143, SEQ ED NO: 144, SEQ ID NO: 145, SEQ 
SEQ ED NO: 1 47, SEQ ED NO: 1 48, SEQ ED NO: 1 49, SEQ ID NO: 1 50, SEQ 
SEQ ED NO: 152, SEQ ED NO: 153, SEQ ED NO: 154, SEQ ED NO: 155, SEQ 
SEQ ED NO: 1 57, SEQ ED NO: 1 58, SEQ ED NO: 1 59, SEQ ED NO: 1 60, SEQ 
SEQ ID NO: 1 62, SEQ ED NO: 1 63, SEQ ED NO: 1 64, SEQ ID NO: 1 65. SEQ 
SEQ ED NO: 167, SEQ ED NO: 168, SEQ ED NO: 169, SEQ ID NO: 170, SEQ 
SEQ ED NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, SEQID NO: 1 75, SEQ 
SEQ ED NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ 
SEQ ID NO: 1 82, SEQ ED NO: 1 S3, SEQ ID NO: 1 84, SEQ ED NO: 1 85, SEQ 
SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ED NO: 1 90, SEQ 
SEQ ID NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ 
SEQ ID NO: 197, SEQ ED NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ 
SEQ ED NO:202, SEQ ED NO:203, SEQ ED NO:204, SEQ ID NO:205, SEQ 
SEQ ED NO:207, SEQ ID NO:208, SEQ ED NO:209, SEQ ID NO:2 10, SEQ 
SEQ ED NO:212, SEQ ED NO:213, SEQ ED NO:214, SEQ ID NO:215, SEQ 
SEQ ID NO:2 1 7, SEQ ED NO:2 1 8, SEQ ID NO:2 1 9, SEQ ID NO:220, SEQ 
SEQ ED NO:222. SEQ ID NO:223, SEQ ED NO:224, SEQ ID NO:225, SEQ 
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ID NO:226. SEQ ID NO:22*\ SEQ ID NO. 228. SKQ ID NO:229. SEQ ID NO:230, SEQ 
ID NO:231. SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ 
ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ 
ID NO:24L and SEQ ID NO:242. In preferred embodiments, the antibody binds 
specifically to domains of kinase polypeptides, that are defined supra. 

By ''specific binding affinity 1 ' is meant that the antibody binds to the target kinase 
polypeptide with greater affinity than it binds to other polypeptides under specified 
conditions. Antibodies or antibody fragments are polypeptides that contain regions that 
can bind other polypeptides. The term "specific binding affinity'* describes an antibody 
that binds to a kinase polypeptide with greater affinity than it binds to other polypeptides 
under specified conditions. 

The term "polyclonal" refers to antibodies that are heterogenous populations of 
antibody molecules derived from the sera of animals immunized with an antigen or an 
antigenic functional derivative thereof For the production of polyclonal antibodies, 
various host animals may be immunized by injection with the antigen. Various adjuvants 
may be used to increase the immunological response, depending on the host species. 

"Monoclonal antibodies" are substantially homogenous populations of antibodies 
to a particular antigen. They may be obtained by any technique which provides for the 
production of antibody molecules by continuous cell lines in culture. Monoclonal 
antibodies may be obtained by methods known to those skilled in the art (Kohler et al y 
Nature 256:495-497, 1975, and U.S. Patent No. 4,376,1 10, both of-which are hereby 
incorporated by reference herein in their entirety including any figures, tables, or 
drawings). 

The term "antibody fragment" refers to a portion of an antibody, often the hyper 
variable region and portions of the surrounding heavy and light chains, that displays 
specific binding affinity for a particular molecule. A hyper variable region is a portion of 
an antibody that physically binds to the polypeptide target. 

Antibodies or antibody fragments having specific binding affinity to a kinase 
polypeptide or domains of a kinase polypeptide of the invention may be used in methods 
for detecting the presence and/or amount of kinase polypeptide in a sample by probing the 
sample with the antibody under conditions suitable for kinase-antibody immunocomplex 
formation and detecting the presence and/or amount of the antibody conjugated to the 
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kinase polypeptide. Diagnostic kits for performing such methods may be constructed to 
include antibodies or antibody fragments specific for the kinase as well as a conjugate of a 
binding partner of the antibodies or the antibodies themselves. 

An antibody or antibody fragment with specific binding affinity to a kinase 
5 polypeptide of the invention can be isolated, enriched, or purified from a prokaryotic or 

eukaryotic organism. Routine methods known to those skilled in the art enable production 
of antibodies or antibody fragments, in both prokaryotic and eukaryotic organisms- 
Purification, enrichment, and isolation of antibodies, which are polypeptide molecules, are 
described above. 

10 Antibodies having specific binding affinity to a kinase polypeptide of the invention 

may be used in methods for detecting the presence and/or amount of kinase polypeptide in 
a sample by contacting the sample with the antibody under conditions such that an 
immunocomplex forms and detecting the presence and/or amount of the antibody 
conjugated to the kinase polypeptide. Diagnostic kits for performing such methods may 

15 be constructed to include a first container containing the antibody and a second container 

having a conjugate of a binding partner of the antibody and a label, such as, for example, a 
radioisotope. The diagnostic kit may also include notification of an FDA approved use 

and instructions therefor. 

In a sixth aspect, the invention features a hybndoma which produces an antibody 

20 having specific binding affinity to a kinase polypeptide or a kinase polypeptide domain, 

where the polypeptide is selected from the group consisting of SEQ ID NO: 1 22, SEQ ID 
NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO: 126, SEQ ID NO:127, SEQ ID 
NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID 
NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO; 137, SEQ ID 

25 NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID 

NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID 
NO: 148, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID 
NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID 
NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID NO:161, SEQ ID NO:162, SEQ ID 

30 NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID NO:167, SEQ ID 

NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 171, SEQ ID NO: 1 72, SEQ ID 
NO:173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID 
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NO: 178, SEQ ID NO P 1 '. SEQ ID NO: 180, SEQ ID NOISI. SEQ ID NO: 1 82, SEQ ID 
NO: 1 S3. SEQ ID NO. 1 84, SEQ ID NO: ] 85. SEQ ID NO. 1 S6, SEQ ID NO: I 87, SEQ ID 
NO ISS. SEQ ID NO: 189. SEQ ID NO: 190. SEQ ID NO: 191, SEQ ID NO: 199, SEQ ID 
NO: 193, SEQ ID NO: 194, SEQ ID NO: 195. SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID 
NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID 
NO:203, SEQ ID NO 204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID 
NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID 
NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID 
NO:2 1 8, SEQ ID NO:2 1 9, SEQ ID NO:220, SEQ ID NO:22 1 , SEQ ID NO:222, SEQ ID 
NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ED NO:226, SEQ ID NO:227, SEQ ID 
NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1 , SEQ ID NO:232, SEQ ID 
NO:233, SEQ ID NO:234, SEQ ED NO:235, SEQ ED NO:236, SEQ ID NO:237, SEQ ID 
NO:238, SEQ ED NO:239, SEQ ED NO:240, SEQ ID NO:241, and SEQ ED NO:242; and 
where the domains are defined as above. By "hybridoma" is meant an immortalized cell 
line that is capable of secreting an antibody, for example an antibody to a kinase of the 
invention. In preferred embodiments, the antibody to the kinase comprises a sequence of 
amino acids that is able to specifically bind a kinase polypeptide of the invention. 

In a seventh aspect, the invention features a kinase polypeptide binding agent able 
to bind to a kinase polypeptide selected from the group consisting of SEQ ED NO: 122, 
SEQ ED NO:123, SEQ ED NO:124, SEQ ED NO:125, SEQ ED NO.126, SEQ ED NO:127, 
SEQ ED NO:128, SEQ ID NO:129, SEQ ED NO: 130, SEQ ED NO:131, SEQ ED NO:132, 
SEQ ED NO:133, SEQ ID NO:134, SEQ ID N0.135, SEQ ED NO:136, SEQ ID NO:137, 
SEQ ED NO.138, SEQ ID NO: 139, SEQ EDNO:140, SEQ ED NO:141, SEQ ID NO:142, 
SEQ ID NO: 143, SEQ ED NO: 144, SEQ ED NO: 145, SEQ ED NO: 146, SEQ ID NO: 147, 
SEQ ED NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, 
SEQ ID NO:153. SEQ ID NO:154, SEQ ED NO:155, SEQ ED NO:156, SEQ ID NO:157, 
SEQ ID NO: 158, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 161, SEQ ID NO: 1 62, 
SEQ ED NO: 1 63, SEQ ID NO: 1 64, SEQ ED NO: 1 65. SEQ ID NO: 1 66, SEQ ID NO: 1 67, 
SEQ ED NO: 1 68, SEQ ID NO: 1 69, SEQ ED NO: 1 70, SEQ ED NO: 171. SEQ ID NO: 1 72, 
SEQ ED NO: 1 73, SEQ ED NO: 1 74, SEQ ED NO: 1 75, SEQ ED NO: 1 76, SEQ ID NO: 1 77, 
SEQ ID NO: 1 78, SEQ ED NO: 1 79, SEQ ED NO: 1 80, SEQ ED NO: 181, SEQ ED NO: 1 82, 
SEQ ED NO: 183, SEQ ID NO: 184, SEQ ED NO: 185, SEQ ED NO: 186, SEQ ID NO: 187, 
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SEQ ID NO: 188, SEQ ID NO: 1 89, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID NO: 199. 
SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195. SEQ ID NO: 196. SEQ ID NO:l<r. 
SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:2(X), SEQ ID NO:201 . SEQ ID NO:202, 
SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NT) 20". 
5 SEQ ID NO:208. SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1. SEQ ID NO:212, 

SEQIDNO:213. SEQ IDNO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217. 
SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO 222, 
SEQ ED NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO 227, 
SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1 , SEQ ID NO.232, 

1 0 SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ED NO:236, SEQ ID NO:237, 

SEQ ED NO:238, SEQ ED NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID 
NO:242. The binding agent is preferably a purified antibody that recognizes an epitope 
present on a kinase polypeptide of the invention. Other binding agents include molecules 
that bind to kinase polypeptides and analogous molecules that bind to a kinase 

1 5 polypeptide. Such binding agents may be identified by using assays that measure kinase 

binding partner activity, such as those that measure PDGFR activity. 

The invention also features a method for screening for human cells containing a 
kinase polypeptide of the invention or an equivalent sequence. The method involves 
identifying the novel polypeptide in human cells using techniques that are routine and 

20 standard in the art, such as those described herein for identifying the kinases of the 

invention (e.g., cloning, Southern or Northern blot analysis, in situ hybridization, PCR 
amplification, etc.). 

In an eighth aspect, the invention features methods for identifying a substance that 
modulates kinase activity comprising the steps of: (a) contacting a kinase polypeptide 

25 selected from the group consisting of SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124. 

SEQ ED NO: 125, SEQ ID NO: 126, SEQ ED NO: 127. SEQ ID NO:12S. SEQ ID NO: 129, 
SEQ ED NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133. SEQ ID NO: 134, 
SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO: 138. SEQ ID NO:139, 
SEQ ED NO: 140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143. SEQ ID NO:144. 

30 SEQ ED NO: 145, SEQ ED NO: 146, SEQ ID NO: 147. SEQ ID NO: 148. SEQ ID NO: 149. 

SEQ ED NO: 150, SEQ ID NO:151. SEQ ID NO:152, SEQ ID NO:153. SEQ ID NO:154. 
SEQ ED NO: 155. SEQ ID NO: 1 56. SEQ ID NO: 1 57. SEQ ID NO: 1 58. SEQ ID NO: 1 59. 
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SEQ ID NO: 160. SEQ ID NO:161, SEQ ID NO:162, SEQ ID NOT63, SMQ ID NO: 104. 
SEQ ID NO:165. SEQ ID NO:166. SEQ ID NO:167, SEQ ID NO:16S, SEQ ID NO:169, 
SEQ ID NO: 170, SEQ ID NO:171, SEQ ID NO:172, SEQ ID NO:173, SEQ ID NO: 174, 
SEQ ID NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ ID NO:179, 
SEQ ID NO:1SO, SEQ ID NO:181, SEQ ID NO:182, SEQ ID NO: 1 83, SEQ ID N01S4, 
SEQ ID NO:185, SEQ ID NO: 186, SEQ ID NO:187, SEQ ID NO:188, SEQ ID NO: 189, 
SEQ ID NO:190, SEQ ID NO:191, SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194, 
SEQ ED NO:195, SEQ ID NO:196, SEQ ED NO:197, SEQ ID NO:198, SEQ ID NO:199, 
SEQ ED NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, 
SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ED NO:209, 
SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:2I2, SEQ ID NO:213, SEQ ID NO:214, 
SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ED NO:218, SEQ ID NO:219, 
SEQ ID NO:220, SEQ ID NO:221 t SEQ ID NO:222, SEQ ED NO:223, SEQ ID NO:224, 
SEQ ID NO:225, SEQ ID NO:226, SEQ ED NO:227, SEQ ED NO:228, SEQ ID NO:229, 
SEQ ID NO:230, SEQ ED NO:231, SEQ ED NO:232, SEQ ED NO:233, SEQ ID NO:234, 
SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ED NO:239, 
SEQ ED NO:240, SEQ ED NO:241, and SEQ ID NO:242 with a test substance; (b) 
measuring the activity of said polypeptide; and (c) determining whether said substance 
modulates the activity of said polypeptide. 

The term "modulates" refers to the ability of a compound to alter the function of a 
kinase of the invention. A modulator preferably activates or inhibits the activity of a 
kinase of the invention. 

The term "activates" refers to increasing the cellular activity of the kinase. The 
term inhibit refers to decreasing the cellular activity of the kinase. Kinase activity is 
preferably the interaction with a natural binding partner. 

The term "modulates" also refers to altering the function of kinases of the 
invention by increasing or decreasing the probability that a complex forms between the 
kinase and a natural binding partner. A modulator preferably increases the probability thai 
such a complex forms between the kinase and the natural binding partner, more preferably 
increases or decreases the probability that a complex forms between the kinase and the 
natural binding partner depending on the concentration of the compound exposed to the 
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kinase, and most preferably decreases the probability that a complex forms between the 
kinase and the natural binding partner. 

The term "complex" refers to an assembly of at least two molecules bound to one 
another. Signal transduction complexes often contain at least two protein molecules 
5 bound to one another. For instance, a protein tyrosine receptor protein kinase, GRB2, 

SOS, RAP, and RAS assemble to form a signal transduction complex in response to a 
mitogenic ligand. 

The term "natural binding partner" refers to polypeptides, lipids, small molecules, 
or nucleic acids that bind to kinases in cells. A change in the interaction between a kinase 
10 and a natural binding partner can manifest itself as an increased or decreased probability 

that the interaction forms, or an increased or decreased concentration of kinase/natural 

binding partner complex. 

The term "contacting" as used herein refers to mixing a solution comprising the 
test compound with a liquid medium bathing the cells of the methods. The solution 
15 comprising the compound may also comprise another component, such as dimethyl 

sulfoxide (DMSO), which facilitates the uptake of the test compound or compounds into 
the cells of the methods. The solution comprising the test compound may be added to the 
medium bathing the cells by utilizing a delivery apparatus, such as a pipet-based device or 
syringe-based device. 

20 In a ninth aspect, the invention features methods for identifying a substance that 

modulates kinase activity in a cell comprising the steps of: (a) expressing a kinase 
polypeptide in a cell, wherein said polypeptide is selected from the group consisting of 
SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, 
SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO:131, 

25 SEQ ID NO:132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO:136, 

SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO:141, 
SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO 146, 
SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID NO 151. 
SEQ ID NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO 1 56, 

30 SEQ ID NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID NO 161, 

SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO 166, 
SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ ID NO: 170, SEQ ID NO. 171, 
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SEQ ID NO 172. SEQ ID NO: 173. SEQ ID NO: 174. SEQ ID NO: 175, SEQ ID NO: 1 76. 
SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 SO, SEQ ID NO: IS!, 
SEQ ID NO: 1 82, SEQ ID NO: 1 S3. SEQ ID NO: 1 84. SEQ ID NO: 185, SEQ ID NO: 1 86, 
SEQ ID NO: 1 87, SEQ ID NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, 
SEQ ID NO: 1 99, SEQ ID NO: 1 93. SEQ ID NO: 1 94, SEQ ID NO 1 95, SEQ ID NO: 1 96, 
SEQ ID NO: 197. SEQ ID NO: 198, SEQ ID NO: 199. SEQ ID NO 200, SEQ ID NO:201, 
SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, 
SEQ ID NO:207, SEQ ID NO:20S, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, 
SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, 
SEQ ID NO:217, SEQ ED NO:218, SEQ ID NO:219, SEQ ED NO:220, SEQ ID NO:221, 
SEQ ID NO:222, SEQ ED NO:223, SEQ ID NO:224, SEQ ED NO:225, SEQ ID NO:226, 
SEQ ED NO:227, SEQ ID NO:228, SEQ ED NO:229, SEQ ED NO:230, SEQ ED NO:231, 
SEQ ED NO:232, SEQ ID NO:233, SEQ ED NO:234, SEQ ED NO:235, SEQ ED NO:236, 
SEQ ID NO:237, SEQ ID NO:238, SEQ ED NO:239, SEQ ID NO:240, SEQ ID NO:241, 
and SEQ ID NO:242; (b) adding a test substance to said cell; and (c) monitoring a change 
in cell phenotype or the interaction between said polypeptide and a natural binding 
partner. 

The term "expressing" as used herein refers to the production of kinases of the 
invention from a nucleic acid vector containing kinase genes within a cell. The nucleic 
acid vector is transfected into cells using well known techniques in the art as described 
herein. 

In a tenth aspect, the invention provides methods for treating a disease or abnormal 
condition by administering to a patient in need of such treatment a substance that 
modulates the activity of a polypeptide selected from the group consisting of SEQ ED 
NO: 122, SEQ ED NO: 123, SEQ ED NO: 124, SEQ ED NO: 125, SEQ ED NO: 126, SEQ ID 
NO: 1 27, SEQ ED NO: 1 28, SEQ ED NO:l 29, SEQ ED NO: 1 30, SEQ ID NO: 131, SEQ ID 
NO: 132, SEQ ED NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID 
NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID 
NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID 
NO: 1 47, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ED NO: 151, SEQ ID 
NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ID 
NO: 157, SEQ ID NO:158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO:161, SEQ ID 
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NO: 162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID 
NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO : 1 7 1 , SEQ ID 
NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO 1 76, SEQ ID 
NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 181, SEQ ID 
5 NO:182, SEQ ID NO:183, SEQ ID NO:184, SEQ ID NO:185, SEQ ID NO: 186, SEQ ID 

NO; 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO 1 9 1 , SEQ ID 
NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID 
NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO. 200, SEQ ID NO:201, SEQ ID 
NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 

10 NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID 

NO:212,SEQ ID NO:213, SEQ ED NO:214, SEQ ID NO:215, SEQ IDNO:216, SEQ ID 
NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:22K SEQ ID 
NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ED NO:230, SEQ ID NO:231, SEQ ID 

15 NO:232, SEQ ED NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 

NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ 
ID NO:242. Preferably, the disease is selected from the group consisting of immune- 
related diseases and disorders, cardiovascular disease, neurodegenerative disorders, and 
cancer. Also included are metabolic disorders, such as diabetes mellitus, and reproductive 

20 disorders, such as infertility. 

Preferably, the disease or disorder is selected from the group consisting of 
rheumatoid arthritis, artherosclerosis, autoimmune disorders, and organ transplantation. 
Preferably the disease or disorder is selected from the group consisting of immune-related 
diseases and disorders, myocardial infarction, cardiomyopathies, stroke, renal failure, and 

25 oxidative stress-related neurodegenerative disorders. Most preferably, the immune-related 

diseases and disorders are selected from the group consisting of rheumatoid arthritis, 
chronic inflammatory bowel disease, chronic inflammatory pelvic disease, multiple 
sclerosis, asthma, osteoarthritis, psoriasis, atherosclerosis, rhinitis, autoimmunity, and 
organ transplantation. 

30 Substances useful for treatment of disorders or diseases preferably show positive 

results in one or more in vitro assays for an activity corresponding to treatment of the 
disease or disorder in question Substances that modulate the activity of the polypeptides 
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preferably include, but are not limited to. antisense oligonucleotides and inhibitors of 
protein kinases. 

The lerm "preventing" refers to decreasing the probability that an organism 
contracts or develops an abnormal condition. 

The term "treating" refers to having a therapeutic effect and at least partially 
alleviating or abrogating an abnormal condition in the organism. 

The term "therapeutic effect" refers to the inhibition or activation factors causing 
or contributing to the abnormal condition. A therapeutic effect relieves to some extent one 
or more of the symptoms of the abnormal condition. In reference to the treatment of 
abnormal conditions, a therapeutic effect can refer to one or more of the following: (a) an 
increase in the proliferation, growth, and/or differentiation of cells; (b) inhibition (i.e., 
slowing or stopping) of cell death; (c) inhibition of degeneration; (d) relieving to some 
extent one or more of the symptoms associated with the abnormal condition; and (e) 
enhancing the function of the affected population of cells. Compounds demonstrating 
efficacy against abnormal conditions can be identified as described herein. 

The term "abnormal condition" refers to a function in the cells or tissues of an 
organism that deviates from their normal functions in that organism. An abnormal 
condition can relate to cell proliferation, cell differentiation or cell survival. An abnormal 
condition may also include irregularities in cell cycle progression, i.e., irregularities in 
normal cell cycle progression through mitosis and meiosis. 

Abnormal cell proliferative conditions include cancers such as fibrotic and 
mesangial disorders, abnormal angiogenesis and vasculogenesis, wound healing, psoriasis, 
diabetes mellitus, and inflammation. 

Abnormal differentiation conditions include, but are not limited to 
neurodegenerative disorders, slow wound healing rates, and slow tissue grafting healing 
rates. 

Abnormal cell survival conditions relate to conditions in which programmed cell 
death (apoptosis) pathways are activated or abrogated. A number of protein kinases are 
associated with the apoptosis pathways. Aberrations in the function of any one of the 
protein kinases could lead to cell immortality or premature cell death. 
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The term "aberration^, in conjunction with the function of a kinase in a signal 
transduction process, refers to a kinase that is over- or under-expressed in an organism, 
mutated such that its catalytic activity is lower or higher than wild-type protein kinase 
activity, mutated such that it can no longer interact with a natural binding partner, is no 
longer modified by another protein kinase or protein phosphatase, or no longer interacts 
with a natural binding partner. 

The term ^administering" relates to a method of incorporating a compound into 
cells or tissues of an organism. The abnormal condition can be prevented or treated when 
the cells or tissues of the organism exist within the organism or outside of the organism. 
Cells existing outside the organism can be maintained or grown in cell culture dishes. For 
cells harbored within the organism, many techniques exist in the art to administer 
compounds, including (but not limited to) oral, parenteral, dermal, injection, and aerosol 
applications. For cells outside of the organism, multiple techniques exist in the art to 
administer the compounds, including (but not limited to) cell microinjection techniques, 
transformation techniques, and carrier techniques. 

The abnormal condition can also be prevented or treated by administering a 
compound to a group of cells having an aberration in a signal transduction pathway to an 
organism. The effect of administering a compound on organism function can then be 
monitored. The organism is preferably a mouse, rat, rabbit, guinea pig, or goat, more 
preferably a monkey or ape, and most preferably a human. 

In an eleventh aspect, the invention features methods for detection the expression 
of a polypeptide in a sample as a diagnostic tool for diseases or disorders, wherein the 
method comprises the steps of: (a) contacting the sample with a nucleic acid probe which 
hybridizes under hybridization assay conditions to a nucleic acid target region of a kinase 
polypeptide selected from the group consisting of SEQ ID NO: 122, SEQ ID NO: 123, SEQ 
ID NO:124, SEQ ED NO:125, SEQ ID NO: 126, SEQ ID NO:127, SEQ ID NO:128, SEQ 
ID NO: 1 29, SEQ ID NO: 1 30, SEQ ID NO: 1 3 1 , SEQ ID NO: 1 32, SEQ ID NO: 1 33, SEQ 
ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:13S, SEQ 
ID NO.T39, SEQ ID NO: 140, SEQ ID NO.T41, SEQ ID NO: 142, SEQ ID NO: 143, SEQ 
ID NO: 1 44, SEQ ID NO: 1 45, SEQ ID NO: 1 46, SEQ ID NO: 1 47, SEQ ID NO: 1 48, SEQ 
ID NO:149, SEQ ID NO: 150, SEQ ID NOT51, SEQ ID NO:152, SEQ ID NO:153, SEQ 
ID NO:154, SEQ ID NO:155, SEQ ID NO: 156, SEQ ID NO.T57, SEQ ID NO:15S, SEQ 
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ID NO: 159, SEQ ID NO: 160. SEQ ID NO: 101. SEQ ID NO: 162. SEQ ID NO: 163. SEQ 
ID NO 164, SEQ ID NO: 165. SEQ ID NO: 166. SEQ ID NO: 167, SEQ ID NO: 168, SEQ 
ID NO 169. SEQ ID NO: 170, SEQ ID NO: 171. SEQ ID NO. 172, SEQ ID NO: 173. SEQ 
ID NO 174, SEQ ID NO:175, SEQ ID NO:176. SEQ ID NO:177, SEQ ID NO:17S. SEQ 
ID NO 179, SEQ ID NO: 180, SEQ ID NO: 181, SEQ ID NO: 182, SEQ ID NO: 183, SEQ 
ID N0184, SEQ ED NO:185, SEQ ID NO:186, SEQ ID NO:187, SEQ ID NO:188, SEQ 
ID NO:189, SEQ ID NO: 190, SEQ ED NO:191, SEQ ID NO:199, SEQ ID NO:193, SEQ 
ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, SEQ 
ID NO: 199, SEQ ED NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ 
ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ 
ID NO:209, SEQ ID NO:210, SEQ ED NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ 
ID NO:214, SEQ ED NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ED NO:218, SEQ 
ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ 
ID NO:224, SEQ ID NO:225, SEQ ED NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ 
ID NO:229, SEQ ED NO:230, SEQ ID NO:23I, SEQ ID NO:232, SEQ ID NO:233, SEQ 
ED NO:234, SEQ ID NO:235, SEQ ED NO:236, SEQ ED NO:237, SEQ ID NO:238, SEQ 
ED NO:239, SEQ ID NO:240, SEQ ED NO:241, and SEQ ED NO:242, said probe 
comprising the nucleic acid sequence encoding the polypeptide, fragments thereof, and the 
complements of the sequences and fragments; and (b) detecting the presence or amount of 
the probe.target region hybrid as an indication of the disease. 

In preferred embodiments of the invention, the disease or disorder is selected from 
the group consisting of rheumatoid arthritis, artherosclerosis, autoimmune disorders, organ 
transplantation, myocardial infarction, cardiomyopathies, stroke, renal failure, oxidative 
stress-related neurodegenerative disorders, metabolic disorder including diabetes, 
reproductive disorders including infertility, and cancer. 

The kinase "target region" is a nucleotide base sequence selected from the group 
consisting of those set forth in SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID 
NO:4, SEQ ID NO:5, SEQ ED NO:6, SEQ ED NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ 
ID NO: 1 0, SEQ ED NO: 1 1 , SEQ ID NO: 1 2, SEQ ID NO: 1 3, SEQ ED NO: 1 4, SEQ ID 
NO: 1 5, SEQ ID NO: 1 6, SEQ ID NO. 1 7. SEQ ID NO: 1 8, SEQ ID NO: 1 9, SEQ ID NO:20, 
SEQ ID NO:21, SEQ ID NO:22. SEQ ED NO:23, SEQ ID NO:24, SEQ ID NO:25. SEQ 
ID NO.26, SEQ ID NO:27. SEQ ID NO:28. SEQ ED NO:29, SEQ ID NO:30. SEQ ID 
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NO:31. SEQ ID NO:32. SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, S EQ ID NO:36. 
SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO;39, SEQ ID NO:40. SEQ ID NO:41, SHQ 
ID NO:42, SEQ ID NO:43. SEQ ID iNO:44, SEQ ID NO:45. SEQ ID NO:46, SEQ ID 
NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO 50, SEQ ID NO:51, SEQ ID NO :>2. 
SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ 
ID NO:58, SEQ ED NO:59, SEQ ID NO:60, SEQ ID NO:6 1 , SEQ ID NO:62, SEQ ID 
NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ ID NO:67, SEQ ID NO:6S, 
SEQ ID NO:69, SEQ ID NO:70, SEQ ID NO:71, SEQ ED NO:72, SEQ ID NO:73, SEQ 
ID NO:74, SEQ ED NO:75, SEQ ED NO:76, SEQ ED NO:77, SEQ ED NO:78, SEQ ED 
NO:79, SEQ ID NO:80, SEQ ED NO:81, SEQ ED NO:82, SEQ ID NO:83, SEQ ED NO:84, 
SEQ ID NO:85, SEQ ED NO:86, SEQ ED NO:87, SEQ ED NO:88, SEQ ID NO:89, SEQ 
ID NO:90, SEQ ED NO:91, SEQ ID NO:92, SEQ ED NO:93. SEQ ID NO:94, SEQ ID 
NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ED NO:98, SEQ ID NO:99, SEQ ID 
NO: 100, SEQ ED NO: 101, SEQ ED NO: 102, SEQ ID NO: 103, SEQ ID NO: 104, SEQ ID 
NO: 1 05, SEQ ID NO: 1 06, SEQ ED NO: 1 07, SEQ ED NO: 1 08, SEQ ID NO: 1 09, SEQ ID 
NO:l 10, SEQ ED NO: 1 1 1, SEQ ED NO:l 12, SEQ ED NO:l 13, SEQ ID NO: 1 14, SEQ ID 
NO:l 15, SEQ ID NO:l 16, SEQ ID NO: 11 7, SEQ ID NO:l 18, SEQ ID NO:l 19, SEQ ID 
NO: 120, and SEQ ED NO: 121, or the corresponding full-length sequences, a functional 
derivative thereof, or a fragment thereof to which the nucleic acid probe will specifically 
hybridize. Specific hybridization indicates that in the presence of other nucleic acids the 
probe only hybridizes detectably with the kinase of the invention's target region. Putative 
target regions can be identified by methods well known in the art consisting of alignment 
and comparison of the most closely related sequences in the database. 

In preferred embodiments the nucleic acid probe hybridizes to a kinase target 
region encoding at least 6, 12, 75, 90, 105, 120, 150, 200, 250, 300 or 350 contiguous 
amino acids of the sequence set forth in SEQ ED NO: 122, SEQ ID NO: 123, SEQ ID 
NO: 1 24, SEQ ID NO: 1 25, SEQ ID NO: 1 26, SEQ ID NO: 1 27, SEQ ID NO 1 28, SEQ ID 
NO:129, SEQ ID NO:130. SEQ IDNO:131, SEQ ID NO:132, SEQ ID NO. 133, SEQ ID 
NO: 134, SEQ ID NO: 135, SEQ ID NO: 1 36, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID 
NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID 
NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO: 148. SEQ ID 
NO:149, SEQ ID NO: 150, SEQ ID NO:151, SEQ ID NO:152. SEQ ID NO:153. SEQ ID 
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NO: 154, SEQ ID NO: 155, SEQ ID NO: 156. SEQ ID NO: 157, SEQ ID NO: 158. SEQ ID 
NO: 159. SEQ ID NO: 160. SEQ ID NO: 161. SEQ ID NO: 162. SEQ ID NO: 163. SEQ ID 
NO: 164. SEQ ID NO: 165. SEQ ID NO: 166. SEQ ID NO: 167, SEQ ID NO: 168. SEQ ID 
NO: 1 69, SEQ ID NO 1 70. SEQ ID NO: 171, SEQ ID NO: 1 72. SEQ ID NO: 1 73. SEQ ID 



NO: 1 74, SEQ ID NO: 1 75. SEQ ID NO: 1 76 
NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 
NO:184, SEQ ID NO:185, SEQ ID NO:186 
NO: 189, SEQ ID NO: 190, SEQ ID NO: 191 
NO: 194, SEQ ID NO: 195, SEQ ID NO: 196 
NO: 199, SEQ ID NO.200, SEQ ED NO:201 
NO:204, SEQ ED NO:205, SEQ ID NO:206 
NO:209, SEQ ED NO:210, SEQ ID NO:21 1 
NO:214, SEQ ID NO:215, SEQ ED NO:216 
NO:219, SEQ ID NO:220, SEQ ED NO:221 
NO:224, SEQ ID NO:225, SEQ ED NO:226 
NO:229, SEQ ID NO:230, SEQ ED NO:231 
NO:234, SEQ ID NO:235, SEQ ID NO:236 



SEQ ID NO: 177, SEQ ID NO: 1 78, SEQ ID 
SEQ ID NO: 1 82. SEQ ID NO: 1 83, SEQ ID 
SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID 
SEQ ID NO: 199, SEQ ID NO: 193, SEQ ID 
SEQ ID NO: 197, SEQ ID NO: 198, SEQ ID 
SEQ ID NO:202, SEQ ID NO:203, SEQ ID 
SEQ ED NO:207, SEQ ID NO:208, SEQ ID 
SEQ ED NO:212, SEQ ID NO:213, SEQ ID 
SEQ ED NO:21 7, SEQ ED NO:218, SEQ ID 
SEQ ED NO:222, SEQ ED NO:223, SEQ ID 
SEQ ED NO:227, SEQ ED NO:228, SEQ ID 
SEQ ID NO-.232, SEQ ED NO:233, SEQ ID 
SEQ ED NO:237, SEQ ID NO:238, SEQ ID 
and SEQ ID NO: 242, or the corresponding 



NO:239, SEQ ID NO:240, SEQ ID NO:241 
full-length amino acid sequence, or a functional derivative thereof. Hybridization 
conditions should be such that hybridization occurs only with the kinase genes in the 
presence of other nucleic acid molecules. Under stringent hybridization conditions only 
highly complementary nucleic acid sequences hybridize. Preferably, such conditions 
prevent hybridization of nucleic acids having more than 1 or 2 mismatches out of 20 
contiguous nucleotides. Such conditions are defined supra. 

Hybridization conditions should be such that hybridization occurs only with the 
genes in the presence of other nucleic acid molecules. Under stringent hybridization 
conditions only highly complementary nucleic acid sequences hybridize. Preferably, such 
conditions prevent hybridization of nucleic acids having 1 or 2 mismatches out of 20 
contiguous nucleotides. Such conditions are defined supra. 

The diseases for which detection of kinase genes in a sample could be diagnostic 
include diseases in which kinase nucleic acid (DNA and/or RNA) is amplified in 
companson to normal cells. By "amplification" is meant increased numbers of kinase 
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DNA or RNA in a cell compared with normal cells. In normal cells, kinases are typically 
found as single copy genes. In selected diseases, the chromosomal location of the kinase 
genes may be amplified, resulting in multiple copies of the gene, or amplification. Gene 
amplification can lead to amplification of kinase RNA, or kinase RNA can be amplified in 
5 the absence of kinase DNA amplification. 

"Amplification" as it refers to RNA can be the detectable presence of kinase RNA 
in cells, since in some normal cells there is no basal expression of kinase RNA. In other 
normal cells, a basal level of expression of kinase exists, therefore in these cases 
amplification is the detection of at least 1-2-fold, and preferably more, kinase RNA, 

10 compared to the basal level. 

The diseases that could be diagnosed by detection of kinase nucleic acid in a 
sample preferably include cancers. The test samples suitable for nucleic acid probing 
methods of the present invention include, for example, cells or nucleic acid extracts of 
cells, or biological fluids. The samples used in the above-described methods will vary 

15 based on the assay format, the detection method and the nature of the tissues, cells or 

extracts to be assayed. Methods for preparing nucleic acid extracts of cells are well 
known in the art and can be readily adapted in order to obtain a sample that is compatible 

with the method utilized. 

Another aspect of the invention involves a method of agonizing (stimulating) or 

20 antagonizing a target of the invention and a natural binding partner associated activity in a 

mammal comprising administering to said mammal an agonist or antagonist to one of the 
above disclosed polypeptides in an amount sufficient to effect said agonism or 
antagonism. A method of treating diseases in a mammal with an agonist or antagonist of 
the protein of the present invention activity comprising administering the agonist or 

25 antagonist to a mammal in an amount sufficient to agonize or antagonize associated 

functions is also encompassed in the present application. 

In an effort to discover novel treatments for diseases, biomedical researchers and 
chemists have designed, synthesized, and tested molecules that inhibit the function of 
protein polypeptides. Some small organic molecules form a class of compounds that 

30 modulate the function of protein polypeptides. Examples of molecules that have been 

reported to inhibit the function of protein kinases include, but are not limited to, bis 
monoevclic, bicvclic or heteroevclic aryl compounds (PCT WO 92/20642, published 
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November 2o. 1992 by Maguire et al.), \'inylene-azaindole denvatives (PCT WO 
94 1 4S0S, published Jul> ~\ 1 994 b> Ballinan a al. ). 1 -cyclopropyI-4-pyridyl-quinolones 
(U.S. Patent No. 5,330.992). styryl compounds (U.S. Patent No. 5,2 1 7,999), styryl- 
substituted pyridyl compounds (U.S. Patent No. 5.302,606), certain quinazoline 
denvatives (EP Application No. 0 566 266 Al), seleoindoles and selenides (PCT WO 
94/03427, published February 17, 1994 by Denny et al.), tricyclic polyhydroxylic 
compounds (PCT WO 92/21660, published December 10, 1992 by Dow), and 
benzylphosphonic acid compounds (PCT WO 91/15495, published October 17, 1991 by 
Dow et al), all of which are incorporated by reference herein, including any drawings. 

Compounds that can traverse cell membranes and are resistant to acid hydrolysis 
are potentially advantageous as therapeutics as they can become highly bioavailable after 
being administered orally to patients. However, many of these protein inhibitors only 
weakly inhibit function. In addition, many inhibit a vanety of protein kinases and will 
therefore cause multiple side-effects as therapeutics for diseases. 

Some indolinone compounds, however, form classes of acid resistant and 
membrane permeable organic molecules. WO 96/22976 (published August 1, 1996 by 
Ballinari et al.) describes hydrosoluble indolinone compounds that harbor tetralin, 
naphthalene, quinoline, and indole substituents fused to the oxindole nng. These bicyclic 
substituents are in turn substituted with polar groups including hydroxylated alkyl, 
phosphate, and ether substituents. U.S. Patent Application Serial Nos. 08/702,232, filed 
August 23, 1996, entitled %t Indolinone Combinatorial Uibranes and Related Products and 
Methods for the Treatment of Disease" by Tang et al. (Lyon & Lyon Docket No. 
221/187) and 08/485,323, filed June 7, 1995, entitled "Benzylidene-Z-Indoline 
Compounds for the Treatment of Disease" by Tang et al. (Lyon & Lyon Docket No. 
223/298) and International Patent Publication WO 96/22976, published August 1, 1996 by 
Ballinan et al., all of which are incorporated herein by reference in their entirety, 
including any drawings, describe indolinone chemical libraries of indolinone compounds 
harbonng other bicyclic moieties as well as monocyclic moieties fused to the oxindole 
nng. Applications 08/702,232, filed August 23, 1996, entitled "Indolinone Combinatorial 
Libraries and Related Products and Methods for the Treatment of Disease" by Tang et al. 
(Lyon & Lyon Docket No. 221/187), 08/485,323, filed June 7, 1995, entitled 
"Benzvlidene-Z-Indoline Compounds for the Treatment of Disease" by Tang et al. (Lyon 
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& Lyon Docket No. 223/298), and WO 96/22976, published August 1, 1996 by Ballinari 
et al. teach methods of indolinone synthesis, methods of testing the biological activity of 
indolinonc compounds in cells, and inhibition patterns of indolinone derivatives, both of 
which are incorporated by reference herein, including any drawings. 
5 Other examples of substances capable of modulating kinase activity include, but 

are not limited to, tyrphostins, quinazolines, qumoxolines, and qumolines. The 
quinazolines, tyrphostins, qumolines, and quinoxohnes referred to above include well 
known compounds such as those described in the literature. For example, representative 
publications describing quinazolines include Barker et ah, EPO Publication No. 0 520 722 

10 Al; Jones et al., U.S. Patent No. 4,447,608; Kabbe et ah, U.S. Patent No. 4,757,072; Kaul 

and Vougioukas, U.S. Patent No. 5, 316,553; Kjeighbaum and Comer, U.S. Patent No. 
4,343,940; Pegg and Wardleworth, EPO Publication No. 0 562 734 Al ; Barker et al., Proc. 
of Am. Assoc. for Cancer Research 32:327 (1991); Bertino, J.R., Cancer Research 3:293- 
304 (1979); Bertino, J.R., Cancer Research 9(2 part l):293-304 (1979); Curtin et al., Br. J. 

15 Cancer 53:361-368 (1986); Fefnandes et ah, Cancer Research 43:1 1 17-1 123 (1983); Ferris 

et al. J. Org. Chem. 44(2): 173-1 78; Fry et al., Science 265:1093-1095 (1994); Jackman et 
ah, Cancer Research 51:5579-5586 (1981); Jones et al. J. Med. Chem. 29(6):1 1 14-1 118; 
Lee and Skibo, Biochemistry 26(23):7355-7362 (1987); Lemus et ah, J. Org. Chem. 
54:3511-3518 (1989); Ley and Seng, Synthesis 1975:415-522 (1975); Maxwell et al., 

20 Magnetic Resonance in Medicine 17:189-196 (1991); Mini et al., Cancer Research 

45:325-330(1985); Phillips and Castle. J. Heterocyclic Chem. 17(19): 1489-1 596 (1980); 
Reece et al., Cancer Research 47(1 1):2996-2999 (1977); Sculier et al., Cancer Immunol, 
and Immunother. 23:A65 (1986); Sikora et ah. Cancer Letters 23:289-295 (1984); and 
Sikora et al., Analytical Biochem. 172:344-355 (1 988), all of which are incorporated 

25 herein by reference in their entirety, including any drawings. 

Qumoxahne is described in Kaul and Vougioukas, U.S. Patent No. 5,316,553, 
incorporated herein by reference in its entirety, including any drawings. 

Quinolines are described in Dolle et al., J. Med. Chem. 37:2627-2629 ( 1 994); 
MaGmre, J. Med. Chem. 37:2129-2131 (1994); Burke et al, J. Med. Chem. 36:425-432 

30 (1993); and Burke et al. BioOrganic Med. Chem. Letters 2:1771-1774 (1992), all of which 

are incorporated by reference in their entirety, including any drawings. 
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Tyrphostins arc described in Allen et al., Clin. Exp Immunol. 91 : 141-156 ( 1993), 
Anafi et al.. Blood 82:12:3524-3529 (1993), Baker et al., J. Cell Sci. 102:543-555 (1992), 
Bilder et al., Amer. Phvsiol. Soc. pp. 6363-6 143 :C72 1 -C730 ( 1 991 ); Brunton et al , 
Proceedings of Amer. Assoc. Cancer Rsch. 33:558 (1992); Bryckaert et al., Experimenta l 
Cell Research 199:255-261 (1992); Dong et al., J. Leukocyte Biology 53:53-60 (1993); 
Dong et al., J. Immunol. 15 1(5):271 7-2724 (1993); Gazit et al., J. Med. Chem. 32:2344- 
2352 (1989); Gazit et al., " J. Med. Chem. 36:3556-3564 (1993); Kaur et al., Anti-Cancer 
Drugs 5:213-222 (1994); Kaur et al.. King et al., Biochem. J. 275:413-418 (1991); Kuo et 
al., Cancer Letters 74:197-202 (1993); Levitzki, A., The FASEB J. 6:3275-3282 ( 1 992); 
Lyall et al., J. Biol. Chem. 264:14503-14509 (1989); Peterson et al., The Prostate 22:335- 
345 (1993); Pillemer et al., Int. J. Cancer 50:80-85 (1992), Posner et al., Molecular 
Pharmacology 45:673-683 (1993); Rendu et al., Biol. Pharmacology 44(5):881-888 
(1992); Sauro and Thomas, Life Sciences 53:371-376 (1993); Sauro and Thomas, J. 
Pharm. and Experimental Therapeutics 267(3): 1 19-1 125 (1993); Wolbring et al., J. Biol. 
Chem. 269(36):22470-22472 ( 1 994); and Yoneda et al., Cancer Research 5 1 :4430-4435 
(1991); all of which are incorporated herein by reference in their entirety, including any 
drawings. 

Other compounds that could be used as modulators include oxindolinones such as 
those described in U.S. patent application Serial No. 08/702,232 filed August 23, 1996, 
incorporated herein by reference in its entirety, including any drawings. 
Methods of Treating a Disease (Enablement - i.e.. Dosing) 

Methods of determining the dosages of compounds to be administered to a patient 
and modes of administering compounds to an organism are disclosed in U.S. Application 
Serial No. 08/702,282, filed August 23, 1996 and International patent publication number 
WO 96/22976, published August 1 1996, both of which are incorporated herein by 
reference in their entirety, including any drawings, figures or tables. Those skilled in the 
art will appreciate that such descriptions are applicable to the present invention and can be 
easily adapted to it. 

The proper dosage depends on various factors such as the type of disease being 
treated, the particular composition being used and the size and physiological condition of 
the patient. Therapeutically effective doses for the compounds described herein can be 
estimated initially from cell culture and animal models. For example, a dose can be 
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formulated in animal models to achieve a circulating concentration range that initially 
takes into account the IC\o as determined in cell culture assays "I he animal model data 
can be used to more accurately determine useful doses in humans. 

Plasma half-life and biodistnbution of the drug and metabolites in the plasma, 
tumors and major organs can also be determined to facilitate the selection of drugs most 
appropriate to inhibit a disorder. Such measurements can be carried out. For example, 
HPLC analysis can be performed on the plasma of animals treated with the drug and the 
location of radiolabeled compounds can be deter-mined using detection methods such as 
X-ray, CAT scan and MRI. Compounds that show potent inhibitory activity in the 
screening assays, but have poor pharmacokinetic characteristics, can be optimized by 
altering the chemical structure and retesting. In this regard, compounds displaying good 
pharmacokinetic characteristics can be used as a model. 

Toxicity studies can also be carried out by measuring the blood cell composition. 
For example, toxicity studies can be carried out in a suitable animal model as follows: 
1) the compound is administered to mice (an untreated control mouse should also be 
used); 2) blood samples are periodically obtained via the tail vein from one mouse in each 
treatment group; and 3) the samples are analyzed for red and white blood cell counts, 
blood cell composition and the percent of lymphocytes versus polymorphonuclear cells. A 
comparison of results for each dosing regime with the controls indicates if toxicity is 
present. 

At the termination of each toxicity study, further studies can be carried out by 
sacrificing the animals (preferably, in accordance with the American Veterinary Medical 
Association guidelines Report of the American Veterinary Medical Assoc. Panel on 
Euthanasia, Journal of American Veterinary Medical Assoc. , 202:229-249, 1993). 
Representative animals from each treatment group can then be examined by gross 
necropsy for immediate evidence of metastasis, unusual illness or toxicity. Gross 
abnormalities in tissue are noted and tissues are examined histologically. Compounds 
causing a reduction in body weight or blood components are less preferred, as are 
compounds having an adverse effect on major organs In general, the greater the adverse 
effect the less preferred the compound. 
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For the trcatmeni of cancers the expected daily dose of a hydrophobic 
pharmaceutical agent is between 1 to 50U mg, day. preferably 1 to 250 mg'day, and most 
preferably 1 to 50 mg/day. Drugs can be delivered less frequently provided plasma levels 
of the active moiety are sufficient to maintain therapeutic effectiveness. 

Plasma levels should reflect the potency of the drug. Generally, the more potent 
the compound the lower the plasma levels necessary to achieve efficacy. 

In a final aspect, the invention features a method for detection of a kinase 
polypeptide in a sample as a diagnostic tool for a disease or disorder, wherein the method 
comprises: (a) comparing a nucleic acid target region encoding the kinase polypeptide in 
a sample, where the kinase polypeptide is selected from the group consisting of SEQ ID 
NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID 
NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID 
NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID 
NO:137, SEQ ED NO:138. SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID 
NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID 
NO.147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID NO:151, SEQ ED 
NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ID 
NO:157, SEQ ED NO:158, SEQ ED NO:159, SEQ ID NO:160, SEQ ID NO:161, SEQ ID 
NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID 
NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ ID NO: 170, SEQ ID NO: 171, SEQ ID 
NO:172, SEQ ID NO:173. SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO.176, SEQ ID 
NO: 177, SEQ ID NO: 178, SEQ ED NO: 179, SEQ ID NO: 180, SEQ ID NO:181, SEQ ID 
NO: 1 82, SEQ ID NO: 1 83. SEQ ID NO: 1 84, SEQ ED NO: 1 85, SEQ ID NO: 1 86, SEQ ID 
NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 9 1 , SEQ ID 
NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID 
NO: 197, SEQ ED NO: 198, SEQ ID NO: 199. SEQ ED NO:200, SEQ ID NO:201, SEQ ID 
NO:202, SEQ ID NO:203, SEQ ED NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 
NO:207, SEQ ED NO:208, SEQ ED NO:209, SEQ ED NO:210, SEQ ID NO:21 1, SEQ ID 
NO:212, SEQ ID NO:213. SEQ ID NO:214, SEQ ED NO:215, SEQ ED NO:216, SEQ ED 
NO:217, SEQ ID NO:218. SEQ ID NO:219, SEQ ID NO:220, SEQ ED NO:221, SEQ ID 
NO:222. SEQ ID NO:223. SEQ ID NO:224. SEQ ID NO:225, SEQ ID NO:226. SEQ ID 
NO:227, SEQ ID NO:228. SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1 . SEQ ID 
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NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 
NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO. 240. SEQ ID NO:241, and SEQ 
ID NO:242, or one or more fragments thereof, with a control nucleic acid target region 
encoding the kinase polypeptide, or one or more fragments thereof; and (b) detecting 
5 differences in sequence or amount between the target region and the control target region, 

as an indication of the disease or disorder. Preferably, the disease or disorder is selected 
from the group consisting of immune-related diseases and disorders, organ transplantation, 
myocardial infarction, cardiovascular disease, stroke, renal failure, oxidative stress-related 
neurodegenerative disorders, and cancer. Immune-related diseases and disorders include, 
1 0 but are not limited to, those discussed previously. 

The term "comparing" as used herein refers to identifying discrepancies between 
the nucleic acid target region isolated from a sample, and the control nucleic acid target 
region. The discrepancies can be in the nucleotide sequences, e.g. insertions, deletions, or 
point mutations, or in the amount of a given nucleotide sequence. Methods to determine 
15 these discrepancies in sequences are well-known to one of ordinary skill in the art. The 

"control" nucleic acid target region refers to the sequence or amount of the sequence 
found in normal cells, e.g. cells that are not diseased as discussed previously. 
The term also includes anti-sense molecules drawn thereto. 
The invention has been described broadly and generically herein. Each of the 
20 narrower species and subgeneric groupings falling within the generic disclosure also form 

part of the invention. This includes the generic description of the invention with a proviso 
or negative limitation removing any subject matter from the genus, regardless of whether 
or not the excised material is specifically recited herein. For example, in some instances 
the nucleotide sequence of particular kinase polypeptides may not be part of a preferred 
25 embodiment. 

The summary of the invention described above is not limiting and other features 
and advantages of the invention will be apparent from the following detailed description of 
the invention, and from the claims. 
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BRIEF DF.SCRIPTION OF THE FIGURES 
Figures 1 A to IBB shows the amino acid sequences of SEQ ID NO: 122. SEQ ID 
NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID 
NO: 128, SEQ ID NO: 129. SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID 
NO: 133. SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID 
NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID 
NO:143, SEQ ID NO:144, SEQ ED NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID 
NO: 1 48, SEQ ID NO: 1 49, SEQ ED NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID 
NO: 153, SEQ ID NO: 154, SEQ ED NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID 
NO: 158, SEQ ED NO: 159, SEQ ED NO: 160, SEQ ID NO: 161, SEQ ED NO: 162, SEQ ID 
NO: 163, SEQ ED NO: 164, SEQ ED NO: 165. SEQ ED NO: 166, SEQ ED NO: 167, SEQ ID 
NO: 1 68, SEQ ID NO: 1 69, SEQ ED NO: 1 70, SEQ ED NO: 171, SEQ ID NO: 1 72, SEQ ID 
NO: 1 73, SEQ ID NO: 1 74, SEQ ED NO: 1 75, SEQ ID NO: 1 76, SEQ ED NO: 1 77, SEQ ID 
NO: 1 78, SEQ ED NO: 1 79, SEQ ID NO: 1 80, SEQ ED NO: 1 8 1 , SEQ ED NO: 1 82, SEQ ID 
NO: 183, SEQ ID NO: 184, SEQ ED NO: 185, SEQ ED NO: 186, SEQ ID NO: 187, SEQ ID 
NO: 188, SEQ ID NO: 189, SEQ ED NO: 190, SEQ ID NO: 191, SEQ ID NO: 199, SEQ ID 
NO:193, SEQ ID NO:194, SEQ ED NO:195, SEQ ED NO: 196, SEQ ID NO:197, SEQ ID 
NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID 
NO:203, SEQ ED NO:204, SEQ ID NO:205, SEQ ID NO.206, SEQ ED NO:207, SEQ ID 
NO:208, SEQ ED NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID 
NO:213. SEQ ID NO:214, SEQ ED NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID 
NO:218, SEQ ED NO:219, SEQ ED NO:220, SEQ ED NO:221, SEQ ED NO:222, SEQ ID 
NO:223, SEQ ED NO:224, SEQ ID NO:225, SEQ ED NO:226, SEQ ED NO:227, SEQ ID 
NO:228, SEQ ID NO:229, SEQ ED NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID 
NO:233, SEQ ID NO:234, SEQ ED NO:235, SEQ ED NO:236, SEQ ED NO:237, SEQ ID 
NO:238, SEQ ED NO:239, SEQ ED NO:240, SEQ ED NO:241, and SEQ ID NO:242. 

Figures 2 A to 2MMMM shows the nucleic acid sequences of SEQ ED NO:l, SEQ 
ID NO:2, SEQ ED NO:3, SEQ ID NO:4, SEQ ED NO:5, SEQ ID NO:6, SEQ ID NO:7, 
SEQ ID NO:8. SEQ ID NO:9, SEQ ID NO: 1 0, SEQ ID NO: 1 1 , SEQ ID NO. 12, SEQ ID 
NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, 
SEQ ID NO: 19, SEQ ID NO:20. SEQ ID NO:21. SEQ ID NO:22, SEQ ID NO. 23, SEQ 
ID NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID 



WO 00/73469 



PCT/USO0/I4842 



74 

NO:29, SEQ ID NO:30. SEQ ID NO:31. SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, 
SEQ ID NO:35, SEQ ID NO:36. SEQ ID NO:37. SEQ ID NO:38, SEQ ID NO:39. SEQ 
ID NO.40, SEQ ID NO:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID 
NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO 50. 
SEQ ID NO:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55. SEQ 
ID NO:56, SEQ ED NO:57, SEQ ED NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID 
NO:61, SEQ ID NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, 
SEQ ED NO:67, SEQ ID NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ ID NO:71, SEQ 
ID NO:72, SEQ ED NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:76, SEQ ID 
NO:77, SEQ ED NO:78, SEQ ID NO:79, SEQ ID NO:80, SEQ ID NO:81, SEQ ID NO:82, 
SEQ ED NO:83, SEQ ID NO:84, SEQ ID NO:85, SEQ ID NO:86, SEQ ID NO:87, SEQ 
ED NO:88, SEQ ED NO:89, SEQ ID NO:90, SEQ ID NO:91, SEQ ID NO:92, SEQ ID 
NO:93, SEQ ED NO:94, SEQ ID NO:95, SEQ ED NO:96, SEQ ID NO:97, SEQ ID NO:98, 
SEQ ED NO:99, SEQ ID NO.TOO, SEQ ID NO:101, SEQ ED NO:102, SEQ ED NO:103, 
SEQ ED NO: 104, SEQ ED NO: 105, SEQ ED NO: 106, SEQ ED NO: 107, SEQ ID NO: 108, 
SEQ ED NO: 1 09, SEQ ED NO: 1 10, SEQ ED NO: 1 1 1 , SEQ ED NO: 1 12, SEQ ED NO: 113, 
SEQ ED NO:l 14, SEQ ED NO:l 15, SEQ ED NO:l 16, SEQ ED NO: 11 7, SEQ ED NO:l 18, 
SEQ ED NO: 1 1 9, SEQ ID NO: 1 20, and SEQ ED NO: 1 2 1 . 

DETAILED DESCRIPTION OF THE INVENTION 
The present invention relates in part to kinase polypeptides, nucleic acids encoding 
such polypeptides, cells containing such nucleic acids, antibodies to such polypeptides, 
assays utilizing such polypeptides, and methods relating to all of the foregoing. The 
present invention is based upon the isolation and characterization of new kinase 
polypeptides. The polypeptides and nucleic acids may be produced using well-known and 
standard synthesis techniques when given the sequences presented herein. 

I. The Nucleic Acids of the Invention 

Included within the scope of this invention are the functional equivalents of the 
herein-described isolated nucleic acid molecules. The degeneracy of the genetic code 
permits substitution of certain codons by other codons that specify the same amino acid 
and hence would give rise to the same protein. The nucleic acid sequence can vary 
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substantially since, with the exception of methionine and triptophan, the known amino 
acids can be coded for by more than one codon. Thus, portions or all of the kinase genes 
of the invention could be synthesized to give a nucleic acid sequence significantly 
different from one selected from the group consisting of those set forth in SEQ ID NO:l, 
5 SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID KO:5, SEQ ID NO:6, SEQ ID 

NO:7. SEQ ID NO:S, SEQ ID NO:9, SEQ ID NO: 10, SEQ ED NO: 1 1. SEQ ID NO: 12, 
SEQ ED NO:13, SEQ ED NO:14, SEQ ED NO:15, SEQ ED NO: 16, SEQ ED NO:17, SEQ 
ED NO: 18, SEQ ED NO: 19, SEQ ED NO:20, SEQ ED NO:21, SEQ ED NO:22, SEQ ED 
NO:23, SEQ ED NO:24, SEQ ED NO:25, SEQ ID NO:26, SEQ ED NO:27, SEQ ED NO:28, 
10 SEQ ED NO:29, SEQ ID NO:30, SEQ ED NO:31, SEQ ED NO:32, SEQ ED NO:33, SEQ 

ID NO:34, SEQ ED NO:35, SEQ ED NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ED 
NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ ED NO:43, SEQ ED NO:44, 
SEQ ED NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ED NO:48, SEQ ID NO:49, SEQ 
ID NO:50, SEQ ID NO:51, SEQ ID NO:52, SEQ ED NO:53, SEQ ID NO:54, SEQ ID 
1 5 NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ED NO:59, SEQ ID NO:60, 

SEQ ED NO:61, SEQ ID NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ 
ID NO:66, SEQ ID N0.67, SEQ ID NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ ID 
NO:71, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:76, 
SEQ ID NO:77, SEQ ED NO:78, SEQ ED NO:79, SEQ ID NO:80, SEQ ID NO:81 , SEQ 
20 ID NO:82, SEQ ED NO:83, SEQ ED NO:84, SEQ ED NO:85, SEQ ID NO:86, SEQ ID 

NO:87, SEQ ID NO:88, SEQ ID NO:89, SEQ ID NO:90, SEQ ED NO:91, SEQ ID NO:92, 
SEQ ED NO:93, SEQ ID NO:94, SEQ ID NO:95, SEQ ED NO:96, SEQ ED NO:97, SEQ 
ID NO:98, SEQ ED NO:99, SEQ ED NO: 100, SEQ ID NO: 101, SEQ ED NO: 102, SEQ ID 
NO: 103, SEQ ID NO: 104, SEQ ED NO: 105, SEQ ED NO: 106, SEQ ED NO: 107, SEQ ID 
25 NO: 1 08, SEQ ID NO: 1 09, SEQ ID NO: 1 1 0, SEQ ID NO. Ill, SEQ ID NO: 1 1 2, SEQ ID 

NO:l 13, SEQ ID NO.l 14, SEQ ED NO:l 15. SEQ ID NO:l 16, SEQ ID NO:l 17, SEQ ID 
NO: 1 1 8, SEQ ID NO 1 19, SEQ ID NO: 1 20, and SEQ ID NO: 1 2 1 . The encoded amino 
acid sequence thereof would, however, be preserved. 

In addition, the nucleic acid sequence may comprise a nucleotide sequence which 
30 results from the addition, deletion or substitution of at least one nucleotide to the 5'-end 

and/or the 3'-end of the nucleic acid sequence shown in SEQ ID NO: 1 , SEQ ID NO:2, 
SEQ ID NO:3. SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID 
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NO:8. SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO:l 1, SEQ ID NO: 12, SEQ ID NO: 13, 
SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: I 7. SEQ ID NO: IS, SEQ 
ID NO: 1 9, SEQ ID NO:2(), SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID 
NO:24, SEQ ID NO:25, SEQ ID NO:26. SEQ ID NO:27. SEQ ID NO:28, SEQ ID NO:29, 
SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ 
ID NO:35, SEQ ID NO:36. SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID 
NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ ID NO:43. SEQ ID NO:44, SEQ ID NO:45. 
SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ 
ID NO:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ED NO:54, SEQ ID NO:55, SEQ ID 
NO:56, SEQ ED NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:6 1 , 
SEQ ED NO:62, SEQ ED NO:63, SEQ ED NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ 
ED NO:67, SEQ ED NO:68, SEQ ED NO:69, SEQ ED NO:70, SEQ ED NO:71, SEQ ED 
NO:72, SEQ ED NO:73, SEQ ED NO:74, SEQ ID NO:75, SEQ ID NO:76, SEQ rD NO:77. 
SEQ ED NO:78, SEQ ED NO:79, SEQ ED NO:80, SEQ ED NO:81, SEQ ID NO:82, SEQ 
ED NO:83, SEQ ED NO:84, SEQ ID NO:85, SEQ ED NO:86, SEQ ED NO:87, SEQ ED 
NO:88, SEQ ED NO:89, SEQ ED NO:90, SEQ ED NO:91, SEQ ED NO:92, SEQ ED NO:93, 
SEQ ED NO:94, SEQ ED NO:95, SEQ ED NO:96, SEQ ED NO:97, SEQ ID NO:98, SEQ 
IDNO:99, SEQ ID NO:100, SEQ ID NO:101, SEQ ED NO:102, SEQ ID NO:103, SEQ ID 
NO: 104, SEQ ID NO: 105, SEQ ID NO: 106, SEQ ID NO: 107, SEQ ID NO: 108, SEQ ID 
NO: 1 09, SEQ ID NO: 1 1 0, SEQ ED NO: 111, SEQ ID NO: 1 1 2, SEQ ID NO: 1 1 3, SEQ ID 
NO: 1 1 4, SEQ ED NO: 1 1 5, SEQ ED NO: 1 1 6, SEQ ID NO: 1 1 7, SEQ ID NO: 1 1 8, SEQ ID 
NO:l 19, SEQ ED NO:120, and SEQ ED NO:121, or a derivative thereof. Any nucleotide 
or polynucleotide may be used in this regard, provided that its addition, deletion or 
substitution does not alter the amino acid sequence of SEQ ED NO: 122, SEQ ID NO: 123, 
SEQ ED NO: 124, SEQ ID NO: 125, SEQ ED NO: 126, SEQ ED NO: 127, SEQ ID NO: 128, 
SEQ ED NO:129, SEQ ED NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID NO:133, 
SEQ ED NO: 134, SEQ ED NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, 
SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID NO:I42, SEQ ID NO:143, 
SEQ ED NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO:148, 
SEQ ED NO:149, SEQ ED NO:150, SEQ ED NO:151, SEQ ID NO.152, SEQ ID NO:153. 
SEQ ED NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157. SEQ ID NO: 158. 
SEQ ID NO: 1 59, SEQ ID NO. 1 60, SEQ ID NO: 1 6 1 , SEQ ID NO: 1 62. SEQ ID NO: 1 63 . 
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SEQ ID KO: 1 04. SEQ ID NO:1t>5. SEQ ID NO loo. SEQ ID NO: 167. SEQ ID NO: 168. 
SEQ ID NO: 169. SEQ ID NO: 170. SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID NO: 173. 
SEQ ID NO: 174. SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID NO: 178, 
SEQ ID NO: 179, SEQ ID NO: 180, SEQ ID NO: 181, SEQ ID NO: 182, SEQ ID NO: 183, 
SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID NO: 187, SEQ ID NO: 188, 
SEQ ID NO:lS9, SEQ ID NO:190, SEQ ID NO:191. SEQ ID NO:199, SEQ ID NO:193, 
SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198, 
SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, 
SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, 
SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, 
SEQ ED NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, 
SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, 
SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, 
SEQ ED NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, 
SEQ ED NO:234, SEQ ED NO:235, SEQ ED NO:236, SEQ ID NO:237, SEQ ED NO:238, 
SEQ ED NO:239, SEQ ED NO:240, SEQ ED NO:241, and SEQ ED NO:242, that is encoded 
by the nucleotide sequence. For example, the present invention is intended to include any 
nucleic acid sequence resulting from the addition of ATG as an initiation codon at the 5'- 
end of the inventive nucleic acid sequence or its derivative, or from the addition of TTA, 
TAG or TGA as a termination codon at the 3'-end of the inventive nucleotide sequence or 
its derivative. Moreover, the nucleic acid molecule of the present invention may, as 
necessary, have restriction endonuclease recognition sites added to its 5'-end and/or 3 1 - 
end. 

Such functional alterations of a given nucleic acid sequence afford an opportunity 
to promote secretion and/or processing of heterologous proteins encoded by foreign 
nucleic acid sequences fused thereto, for example. All variations of the nucleotide 
sequence of the kinase genes of the invention and fragments thereof permitted by the 
genetic code are, therefore, included in this invention. 

Further, it is possible to delete codons or to substitute one or more codons with 
codons other than degenerate codons to produce a structurally modified polypeptide, but 
one which has substantially the same utility or activity as the polypeptide produced by the 
unmodified nucleic acid molecule. As recognized in the art, the two polypeptides are 
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functionally equivalent, as are the two nucleic acid molecules that give rise to their 
production, even though the differences between the nucleic acid molecules are not related 
to the degeneracy of the genetic code. This is discussed further in the "Functional 
Derivatives 1 ' section, herein. 

Finally, many of the nucleic acid molecules of the invention are provided as a 
partial sequence only (Fig. 2A through 2QQ). However, it is standard for one of ordinary 
skill in the art to obtain a full-length sequence when provided with a partial sequence. 
Similarly, when provided with a partial or full-length sequence it is standard for one of 
ordinary skill in the art to obtain nucleic acid sequence coding for homologous proteins. 
Therefore, these nucleic acid molecules are also part of the invention. 

The characteristics of the protein kinase nucleic acid sequences of the invention are 
provided in Table 1. The protein kinases fall into 10 known groups: AGC, CAMK, CKI, 
CMGC, dsPK, EEFK, LEMK, MLK, STE and TK. In addition, there are a significant 
number of protein kinases that do not belong to any of the known groups, and therefore 
presumably define new protein kinase groups. 

Additional characteristics may be found, inter alia, in the tables, namely Table 1 , 
Table 2, Table 3 and Table 4, shown below. 

II. Nucleic Acid Probes, Methods, and Kits for Detection of Protein Kinases. 

A nucleic acid probe of the present invention may be used to probe an appropriate 
chromosomal or cDNA library by usual hybridization methods to obtain other nucleic acid 
molecules of the present invention. A chromosomal DNA or cDNA library may be 
prepared from appropriate cells according to recognized methods in the art (cf. "Molecular 
Cloning: A Laboratory Manual", second edition, Cold Spring Harbor Laboratory, 
Sambrook, Fntsch, & Maniatis, eds., 1989). 

In the alternative, chemical synthesis can be carried out in order to obtain nucleic 
acid probes having nucleotide sequences that correspond to N-terminal, kinase or C- 
terminal portions, for example, of the amino acid sequence of the polypeptide of interest. 
The synthesized nucleic acid probes may be used as primers in a polymerase chain 
reaction (PCR) earned out in accordance with recognized PCR techniques, essentially 
according to PCR Protocols, %l A Guide to Methods and Applications", Academic Press. 
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Michael, ct ai, eds., 1990, utilizing the appropriate chromosomal or cDNA library to 
obtain the fragment of the present invention. 

One skilled m the art can readily design such probes based on the sequence 
disclosed herein using methods of computer alignment and sequence analysis known m 
the art ("Molecular Cloning: A Laboratory Manual' 1 , 1989, supra). The hybridization 
probes of the present invention can be labeled by standard labeling techniques such as 
with a radiolabel, enzyme label, fluorescent label, biotin-avidin label, chemiluminescence, 
and the like. After hybridization, the probes may be visualized using known methods. 

The nucleic acid probes of the present invention include RNA, as well as DNA 
probes, such probes being generated using techniques known in the art. The nucleic acid 
probe may be immobilized on a solid support. Examples of such solid supports include, 
but are not limited to, plastics such as polycarbonate, complex carbohydrates such as 
agarose and sepharose, and acrylic resins, such as polyacrylamide and latex beads. 
Techniques for coupling nucleic acid probes to such solid supports are well known in the 
art. 

The test samples suitable for nucleic acid probing methods of the present invention 
include, for example, cells or nucleic acid extracts of cells, or biological fluids. The 
samples used in the above-described methods will vary based on the assay format, the 
detection method and the nature of the tissues, cells or extracts to be assayed. Methods for 
preparing nucleic acid extracts of cells are well known in the art and can be readily 
adapted in order to obtain a sample that is compatible with the method utilized. 

One method of detecting the presence of nucleic acids of the invention in a sample 
comprises (a) contacting said sample with the above-described nucleic acid probe under 
conditions such that hybridization occurs, and (b) detecting the presence of said probe 
bound to said nucleic acid molecule. One skilled in the art would select the nucleic acid 
probe according to techniques known in the art as described above. Samples to be tested 
include but should not be limited to RNA samples of human tissue. 

A kit for detecting the presence of nucleic acids of the invention in a sample 
comprises at least one container means having disposed therein the above-described 
nucleic acid probe. The kit may further comprise other containers comprising one or more 
of the following* wash reagents and reagents capable of detecting the presence of bound 
nucleic acid probe. Examples of detection reagents include, but are not limited to 
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radiolabeled probes, enzymatic labeled probes (horseradish peroxidase, alkaline 
phosphatase), and affinity labeled probes (biotin, avidin. or stcptavidm) 

In detail, a compartmentalized kit includes any kit in which reagents are contained 
in separate containers. Such containers include small glass containers, plastic containers 
or strips of plastic or paper. Such containers allow the efficient transfer of reagents from 
one compartment to another compartment such that the samples and reagents are not 
cross-contaminated and the agents or solutions of each container can be added in a 
quantitative fashion from one compartment to another. Such containers will include a 
container which will accept the test sample, a container which contains the probe or 
primers used in the assay, containers which contain wash reagents (such as phosphate 
buffered saline, Tris-buffers, and the like), and containers which contain the reagents used 
to detect the hybridized probe, bound antibody, amplified product, or the like. One skilled 
in the art will readily recognize that the nucleic acid probes described in the present 
invention can readily be incorporated into one of the established kit formats that are well 
known in the art. 

HI. DNA Constructs Comprising a Protein Kinase Nucleic Acid Molecule and Cells 
Containing These Constructs. 

The present invention also relates to a recombinant DNA molecule comprising, 5' 
to 3\ a promoter effective to initiate transcription in a host cell and the above-described 
nucleic acid molecules. In addition, the present invention relates to a recombinant DNA 
molecule comprising a vector and an above-described nucleic acid molecule. The present 
invention also relates to a nucleic acid molecule comprising a transcriptional region 
functional in a cell, a sequence complementary to an RNA sequence encoding an amino 
acid sequence corresponding to the above-described polypeptide, and a transcriptional 
termination region functional in said cell. The above-described molecules may be isolated 
and/or purified DNA molecules. 

The present invention also relates to a cell or organism that contains an above- 
described nucleic acid molecule and thereby is capable of expressing a polypeptide. The 
polypeptide may be purified from cells that have been altered to express the polypeptide. 
A cell is said to be "altered to express a desired polypeptide" when the cell, through 
genetic manipulation, is made to produce a protein which it normally does not produce or 
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winch the cell normally produces at lower levels. One skilled in the an can readily adapt 
procedures for introducing and expressing either genomic, cDNA, or synthetic sequences 
into either eukaryotic or prokaryotic cells. 

A nucleic acid molecule, such as DNA, is said to be "capable of expressing" a 
polypeptide if it contains nucleotide sequences which contain transcriptional and 
translational regulatory information and such sequences are "operably linked" to 
nucleotide sequences which encode the polypeptide. An operable linkage is a linkage in 
which the regulatory DNA sequences and the DNA sequence sought to be expressed are 
connected in such a way as to permit gene sequence expression. The precise nature of the 
regulatory regions needed for gene sequence expression may vary from organism to 
organism, but shall in general include a promoter region which, in prokaryotes, contains 
both the promoter (which directs the initiation of RNA transcription) as well as the DNA 
sequences which, when transcribed into RJMA, will signal synthesis initiation. Such 
regions will normally include those 5 '-non-coding sequences involved with initiation of 
transcription and translation, such as the TATA box, capping sequence, CAAT sequence, 
and the like. 

If desired, the non-coding region 3' to the sequence encoding a kinase of the 
invention may be obtained by the above-described methods. This region may be retained 
for its transcriptional termination regulatory 7 sequences, such as termination and 
polyadenylation. Thus, by retaining the 3'-region naturally contiguous to the DNA 
sequence encoding a kinase of the invention, the transcriptional termination signals may 
be provided. Where the transcriptional termination signals are not satisfactorily functional 
in the expression host cell, then a 3 1 region functional in the host cell may be substituted. 

Two DNA sequences (such as a promoter region sequence and a sequence 
encoding a kinase of the invention) are said to be operably linked if the nature of the 
linkage between the two DNA sequences does not (1) result in the introduction of a frame- 
shift mutation, (2) interfere with the ability of the promoter region sequence to direct the 
transcription of a gene sequence encoding a kinase of the invention, or (3) interfere with 
the ability of the gene sequence of a kinase of the invention to be transcribed by the 
promoter region sequence Thus, a promoter region would be operably linked to a DNA 
sequence if the promoter were capable of effecting transcription of that DNA sequence 
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Thus, to express a gene encoding a kinase of the invention, transcriptional and 
translational signals recognized by an appropriate host are necessary. 

The present invention encompasses the expression of a gene encoding a kinase of 
the invention (or a functional derivative thereof) in either prokaryotic or eukarvotic cells. 
Prokaryotic hosts are, generally, very efficient and convenient for the production of 
recombinant proteins and are, therefore, one type of preferred expression system for 
kinases of the invention. Prokaryotes most frequently are represented by various strains of 
E. coli. However, other microbial strains may also be used, including other bacterial 
strains. 

In prokaryotic systems, plasmid vectors that contain replication sites and control 
sequences derived from a species compatible with the host may be used. Examples of 
suitable plasmid vectors may include pBR322, pUCl 18, pUCl 19 and the like; suitable 
phage or bacteriophage vectors may include ygtlO, ygtl 1 and the like; and suitable virus 
vectors may include pMAM-neo, pKRC and the like. Preferably, the selected vector of the 
present invention has the capacity to replicate in the selected host cell. 

Recognized prokaryotic hosts include bacteria such as E. coli, Bacillus, 
Streptomyces, Pseudomonas, Salmonella, Serratia, and the like. However, under such 
conditions, the polypeptide will not be glycosylated. The prokaryotic host must be 
compatible with the replicon and control sequences in the expression plasmid. 

To express a kinase of the invention (or a functional derivative thereof) in a 
prokaryotic cell, it is necessary to operably link the sequence encoding the kinase of the 
invention to a functional prokaryotic promoter. Such promoters may be either constitutive 
or, more preferably, regulatable {i.e., inducible or derepressible). Examples of constitutive 
promoters include the int promoter of bacteriophage X, the bla promoter of the P- 
lactamase gene sequence of pBR322, and the cat promoter of the chloramphenicol acetyl 
transferase gene sequence of pPR325, and the like. Examples of inducible prokaryotic 
promoters include the major right and left promoters of bacteriophage X (P L and P R ), the 
trp, recA, XacZ, AacI, and gal promoters of E coli, the a-amylase (Ulmanen et aL, J. 
Bactenol. 162:176-182, 1985) and the c-28-specific promoters of B. subtilis (Gilman et 
ai. Gene Sequence 32:1 1-20, 1984), the promoters of the bacteriophages of Bacillus 
(Gryczan, In: The Molecular Biology of the Bacilli, Academic Press, Inc., NY, 1982), and 
Streptomyces promoters (Ward et al., Mol. Gen. Genet. 203:468-478, 1986). Prokaryotic 
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promoters arc rev iew ed b\ Giick ( lnd. Microbiol. 1 :2' 7 7-282, 1 9S , ), Cenatiernpo 
(Biochimie 68:505-516. 1986), and Gottesman (Ann. Rev. Genet. 18:415-442, 1984). 

Proper expression in a prokaryotic cell also requires the presence of a ribosome- 
binding site upstream of the gene sequence-encoding sequence. Such nbosome-bindmg 
5 sites are disclosed, for example, by Gold et ai (Ann. Rev. Microbiol. 35:365-404, 1981 ). 

The selection of control sequences, expression vectors, transformation methods, and the 
like, are dependent on the type of host cell used to express the gene. As used herein, 
"cell", "cell line", and "cell culture 1 ' may be used interchangeably and all such 
designations include progeny. Thus, the words "transformants" or "transformed cells" 
10 include the primary subject cell and cultures derived therefrom, without regard to the 

number of transfers. It is also understood that all progeny may not be precisely identical 
in DNA content, due to deliberate or inadvertent mutations. However, as defined, mutant 
progeny have the same functionality as that of the originally transformed cell. 

Host cells which may be used in the expression systems of the present invention 

15 are not strictly limited, provided that they are suitable for use in the expression of the 

kinase polypeptide of interest. Suitable hosts may often include eukaryotic cells. 
Preferred eukaryotic hosts include, for example, yeast, fungi, insect cells, mammalian cells 
either in vivo, or in tissue culture. Mammalian cells which may be useful as hosts include 
HeLa cells, cells of fibroblast origin such as VERO or CHO-K1 , or cells of lymphoid 

20 origin and their derivatives. Preferred mammalian host cells include SP2/0 and J558L, as 

well as neuroblastoma cell lines such as IMR 332, which may provide*better capacities for 
correct post-translational processing. 

In addition, plant cells are also available as hosts, and control sequences 
compatible with plant cells are available, such as the cauliflower mosaic virus 35S and 

25 19S, and nopaline synthase promoter and polyadenylation signal sequences. Another 

preferred host is an insect cell, for example the Drosophila larvae. Using insect cells as 
hosts, the Drosophila alcohol dehydrogenase promoter can be used (Rubin, Science 
240:1453-1459, 1988). Alternatively, baculovirus vectors can be engineered to express 
large amounts of kinases of the invention in insect cells (Jasny, Science 238:1653, 1987; 

30 Miller et aL, In: Genetic Engineering, Vol. 8, Plenum, Setlow et ai. eds., pp. 277-297 , 

1986). 
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Any of a series of yeast expression systems can be utilized which incorporate 
promoter and termination elements from the actively expressed sequences coding for 
ulycolytic enzymes that are produced in large quantities when yeast are grown in mediums 
rich in glucose. Known glycolytic gene sequences can also provide very efficient 
transcriptional control signals. Yeast provides substantial advantages in that it can also 
carry out post-translational modifications. A number of recombinant DNA strategies exist 
utilizing strong promoter sequences and high copy number plasmids which can be utilized 
for production of the desired proteins in yeast, Yeast recognizes leader sequences on 
cloned mammalian genes and secretes peptides bearing leader sequences (i.e., pre- 
peptides). Several possible vector systems are available for the expression of kinases of 
the invention in a mammalian host. 

A wide variety of transcriptional and translational regulatory sequences may be 
employed, depending upon the nature of the host. The transcriptional and translational 
regulatory signals may be derived from viral sources, such as adenovirus, bovine 
1 5 papilloma virus, cytomegalovirus, simian virus, or the like, where the regulatory signals 

are associated with a particular gene sequence which has a high level of expression. 
Alternatively, promoters from mammalian expression products, such as actin, collagen, 
myosin, and the like, may be employed. Transcriptional initiation regulatory signals may 
be selected which allow for repression or activation, so that expression of the gene 
20 sequences can be modulated. Of interest are regulatory signals which are temperature- 

sensitive so that by varying the temperature, expression can be repressed or initiated, or 
are subject to chemical (such as metabolite) regulation. 

Expression of kinases of the invention in eukaryotic hosts requires the use of 
eukaryotic regulatory regions. Such regions will, in general, include a promoter region 
25 sufficient to direct the initiation of RNA synthesis. Preferred eukaryotic promoters 

include, for example, the promoter of the mouse metallothionein 1 gene sequence (Hamer 
et a/., J. Mol. Appl. Gen. 1 :273-288, 1982); the TK promoter of Herpes virus (McKnight, 
Cell 31 :355-365, 1982); the SV40 early promoter (Benoist et al , Nature (London) 
290:304-31, 1981); and the yeast gal4 gene sequence promoter (Johnston et al, Proc. Natl. 
Acad. Sci. (USA) 79:6971-6975, 1982; Silver^ a/., Proc. Natl. Acad. Sci. (USA) 
81:5951-5955, 1984). 
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Translation of eukaryotic mRNA is initialed at the codon thai encodes the first 
methionine. For this reason, it is preferable to ensure that the linkage between a 
eukaryotic promoter and a DKA sequence which encodes a kinase of the invention (or a 
functional derivative thereof) does not contain any intervening codons which are capable 
of encoding a methionine (i.e., AUG). The presence of such codons results either in the 
formation of a fusion protein (if the AUG codon is in the same reading frame as the kinase 
of the invention coding sequence) or a frame-shift mutation (if the AUG codon is not in 
the same reading frame as the kinase of the invention coding sequence). 

A nucleic acid molecule encoding a kinase of the invention and an operably linked 
promoter may be introduced into a recipient prokaryotic or eukaryotic cell either as a 
nonreplicating DNA or RNA molecule, which may either be a linear molecule or, more 
preferably, a closed covalent circular molecule. Since such molecules are incapable of 
autonomous replication, the expression of the gene may occur through the transient 
expression of the introduced sequence. Alternatively, permanent expression may occur 
through the integration of the introduced DNA sequence into the host chromosome. 

A vector may be employed which is capable of integrating the desired gene 
sequences into the host cell chromosome. Cells which have stably integrated the 
introduced DNA into their chromosomes can be selected by also introducing one or more 
markers which allow for selection of host cells which contain the expression vector. The 
marker may provide for prototrophy to an auxotrophic host, biocide resistance, e.g., 
antibiotics, or heavy metals, such as copper, or the like. The selectable marker gene 
sequence can either be directly linked to the DNA gene sequences to be expressed, or 
introduced into the same cell by co-transfection. Additional elements may also be needed 
for optimal synthesis of mRNA. These elements may include splice signals, as well as 
transcription promoters, enhancers, and termination signals. cDNA expression vectors 
incorporating such elements include those described by Okayama (Mol. Cell. Biol. 3:280-, 
1983). 

The introduced nucleic acid molecule can be incorporated into a plasmid or viral 
vector capable of autonomous replication in the recipient host. Any of a wide variety of 
vectors may be employed for this purpose. Factors of importance in selecting a particular 
plasmid or viral vector include: the ease with which recipient cells that contain the vector 
may be recognized and selected from those recipient cells which do not contain the vector; 
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the number of copies of the vector which are desired in a particular host; and whether it is 
desirable to be able to "shuttle 11 the vector between host cells of different species 

Preferred prokaryotic vectors include plasmids such as those capable of replication 
in E. coli (such as, for example, pBR322, ColEL pSClOl, pACYC 184, nVX; "Molecular 
Cloning: A Laboratory Manual", 1 989, supra). Bacillus plasmids include pCl 94, pC22 1 , 
pT127, and the like (Gryczan, In: The Molecular Biology of the Bacilli, Academic Press, 
NY, pp. 307-329, 1982). Suitable Streptomyces plasmids include pi J i 01 (Kendall ei at., 
J. Bactenol. 169:4177-4183, 1987), and streptomyces bacteriophages such as <j)C3 1 
(Chater et al. y In: Sixth International Symposium on Actinomycetales Biology, Akademiai 
Kaido, Budapest, Hungary, pp. 45-54, 1986). Pseudomonas plasmids are reviewed by 
John et al (Rev. Infect. Dis. 8:693-704, 1986), and Izaki (Jpn. J. Bactenol. 33:729-742, 
1978). 

Preferred eukaryotic plasmids include, for example, BPV, vaccinia, S V40, 2- 
micron circle, and the like, or their derivatives. Such plasmids are well known in the art 
(Botstein etal, Miami Wntr. Symp. 19:265-274, 1982; Broach, In: "The Molecular 
Biology of the Yeast Saccharomyces: Life Cycle and Inheritance", Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY, p. 445-470, 1981; Broach, Cell 28:203-204, 1982; 
Bollon et ai, J. Clin. Hematol. Oncol. 10:39-48, 1980; Maniatis, In: Cell Biology: A 
Comprehensive Treatise, Vol. 3, Gene Sequence Expression, Academic Press, NY, pp. 
563-608, 1980). 

Once the vector or nucleic acid molecule containing the construct(s) has been 
prepared for expression, the DNA construct(s) may be introduced into an appropnate host 
cell by any of a variety of suitable means, i.e., transformation, transfection, conjugation, 
protoplast fusion, electroporation, particle gun technology, calcium phosphate- 
precipitation, direct microinjection, and the like. After the introduction of the vector, 
recipient cells are grown in a selective medium, which selects for the growth of vector- 
containing cells. Expression of the cloned gene(s) results in the production of a kinase of 
the invention, or fragments thereof. This can take place in the transformed cells as such, 
or following the induction of these cells to differentiate (for example, by administration of 
bromodeoxyuracil to neuroblastoma cells or the like) A variety of incubation conditions 
can be used to form the peptide of the present invention. The most preferred conditions 
are those which mimic physiological conditions. 
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IV. The Proteins of the Invention 

A variety of methodologies known in the art can be utilized to obtain the 
polypeptides of the present invention. The polypeptides may be purified from tissues or 
cells that naturally produce the polypeptides. Alternatively, the above-described isolated 
nucleic acid fragments could be used to express the kinases of the invention in any 
organism. The samples of the present invention include cells, protein extracts or 
membrane extracts of cells, or biological fluids. The samples will vary based on the assay 
format, the detection method, and the nature of the tissues, cells or extracts used as the 
sample. 

Any eukaryotic organism can be used as a source for the polypeptides of the 
invention, as long as the source organism naturally contains such polypeptides. As used 
herein, "source organism 11 refers to the original organism from which the amino acid 
sequence of the subunit is derived, regardless of the organism the subunit is expressed in 
and ultimately isolated from. 

One skilled in the art can readily follow known methods for isolating proteins in 
order to obtain the polypeptides free of natural contaminants. These include, but are not 
limited to: size-exclusion chromatography, HPLC, ion-exchange chromatography, and 
immuno-affinity chromatography. 

Further, the polypeptides of the invention include the full-length polypeptides that 
can be identified from the full-length or partial sequences encoded by SEQ ID NO: 122, 
SEQ ID NO: 123, SEQ ED NO: 124, SEQ ID NO: 125, SEQ ID NO:126 r SEQ ID NO: 127, 
SEQ ED NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, 
SEQ ID NO:133, SEQ ID NO:134, SEQ ED NO:135, SEQ ID NO:136, SEQ ID NO:137, 
SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, 
SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, 
SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID NO: 151, SEQ ED NO: 152, 
SEQ ED NO:153, SEQ ED NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ID NO: 157, 
SEQ ID NO: 158, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 161, SEQ ID NO: 1 62, 
SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ED NO:166, SEQ ID NO:167, 
SEQ ID NO: 168, SEQ ID NO: 169, SEQ ID NO: 170, SEQ ID NO: 171, SEQ ID NO: 172, 
SEQ ID NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, 
SEQ ID NO: 178, SEQ ID NO: 1 79, SEQ ID NO: 180, SEQ ID NO: 181, SEQ ID NO: 1 82, 
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SEQ ID NO: 183. SEQ ID NO: 184. SEQ ID NO: 1 85, SEQ ID NO: 186. SEQ ID NO: 187. 
SEQ ID NO: 188. SEQ ID NO: 189, SEQ ID NO 190, SEQ ID NO: 191, SEQ ID NO: 1 99, 
SEQ ID NO: 193, SEQ ID NO: 194. SEQ ID NO: 195, SEQ ID NO: 1 96. SEQ ID NO: 197, 
SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO.200, SEQ ID NO:201, SEQ ID NO:202, 
5 SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, 

SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ED NO:21 1. SEQ ID NO:212, 
SEQ ID NO:213, SEQ ED NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, 
SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, 
SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, 
10 SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, 

SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, 
SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID 
NO:242 (Figure 1). In addition, the polypeptides of the invention include the domains of 
these polypeptides, including, but not limited to, the N-terminal, kinase/catalytic, and C- 
1 5 terminal domains. 

The characteristics of the protein kinase nucleic acid sequences of the invention are 
provided in Table 1. The protein kinases fall into 10 known groups: AGC, CAMK, CKI, 
CMGC, dsPK, EEFK, LIMK, MLK, STE and TK. In addition, there are a significant 
number of protein kinases that do not belong to any of the known groups, and therefore 
20 presumably define new protein kinase groups. 

Additional characteristics are shown in, inter alia, the tables, namely Table 1 , 
Table 2, Table 3 and Table 4, provided below. 

V. Antibodies, Hybridomas. Methods of Use and Kits for Detection of Protein 

25 Kinases 

The present invention relates to an antibody having binding affinity to a kinase of 
the invention The polypeptide may have an amino acid sequence selected from the group 
consisting of those set forth in SEQ ID NO: 122, SEQ ID NO: 1 23, SEQ ID NO: 1 24. SEQ 
ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ED NO: 128, SEQ ID NO: 129, SEQ 

30 ED NO:130, SEQ ED NO:131. SEQ ED NO:132, SEQ ID NO: 133, SEQ ID NO:134, SEQ 

ID NO: 135, SEQ ED NO: 1 36, SEQ ED NO: 137, SEQ ID NO: 138, SEQ ID NO: 1 39, SEQ 
ED NO:140, SEQ ED NO:141, SEQ ED NO:142, SEQ ED NO:143, SEQ ID NO: 144, SEQ 
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ID NO:145. SEQ ID NO:14o. SEQ ID NO:147. SEQ ID NO:14S. SEQ ID NO:149. SEQ 
ID NO: 150. SEQ ID NO 151. SEQ ID NO: 152. SEQ ID NO: 1 53, SEQ ID NO: I 54. SEQ 
ID NO: 155, SEQ ID NO: 156. SEQ ID NO 157. SEQ ID NO: 158, SEQ ID NO 159. SEQ 
ID NO: 1 60, SEQ ID NO: 161, SEQ ID NO. 162. SEQ ID NO: 163, SEQ ID NO 164, SEQ 
ID NO:165. SEQ ID NO: 166, SEQ ID NO:167, SEQ ID NO:168, SEQ ID N0169, SEQ 
ID NO: 170, SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ 
ID NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ ID N0.179, SEQ 
ID NO:180, SEQEDNO:181, SEQ ID NO:182, SEQ ID NO:183, SEQIDNO:184, SEQ 
ID NO:185, SEQ ID NO:186, SEQ ID NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ 
ID NO:190, SEQ ID NO:191, SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ 
ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ED NO:198, SEQ ID NO:199, SEQ 
ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ LD NO:204, SEQ 
ID NO:205, SEQ ID NO:206, SEQ ED NO:207. SEQ ED NO:208, SEQ ID NO:209, SEQ 
ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ 
ID NO:215, SEQ ED NO:216, SEQ ID NO:217, SEQ ED NO:218, SEQ ID NO:219, SEQ 
ID NO:220, SEQ ED NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ 
ID NO:225, SEQ ED NO:226, SEQ ED NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ 
ID NO:230, SEQ ID NO:231. SEQ ED NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ 
ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ 
ID NO:240, SEQ ID NO:241, and SEQ ID NO:242, or a functional derivative thereof, or 
at least 9 contiguous amino acids thereof (preferably, at least 20, 30, 35, or 40 or more 
contiguous amino acids thereof). Alternatively, the antibody may bind to a part of the 
polypeptide not provided in the sequences above, but that is present in the full-length 
sequence of the polypeptide and that is easily obtained using methods standard in the art. 
Further, the antibody may bind specifically to particular domains of one or more of the 
kinases of the invention, including, but not. limited to, the N-terminal, kinase/catalytic, or 
C-terminal domains. 

The present invention also relates to an antibody having specific binding affinity to 
a kinase or kinase domain of the invention. Such an antibody may be isolated by 
comparing its binding affinity to a kinase of the invention with its binding affinity to other 
polypeptides. Those that bind selectively to a kinase of the invention would be chosen for 
use in methods requiring a distinction between a kinase of the invention and other 
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polypeptides. Such methods could include, but should not be limited to, the analysis of 
altered kinase expression in tissue containing other polypeptides. 

The kinases of the present invention can be used in a variety of procedures and 
methods, such as for the generation of antibodies, for use in identifying pharmaceutical 
compositions, and for studying DNA/protein interaction. 

The kinases of the present invention can be used to produce antibodies or 
hybridomas. One skilled in the an will recognize that if an antibody is desired, such a 
peptide could be generated as described herein and used as an immunogen. The antibodie 
of the present invention include monoclonal and polyclonal antibodies, as well fragments 
of these antibodies, and humanized forms. Humanized forms of the antibodies of the 
present invention may be generated using one of the procedures known in the art such as 
chimerization or CDR grafting. 

The present invention also relates to a hybridoma that produces the above- 
described monoclonal antibody, or binding fragment thereof A hybridoma is an 
immortalized cell line that is capable of secreting a specific monoclonal antibody. 

In general, techniques for preparing monoclonal antibodies and hybridomas are 
well known in the art (Campbell, "Monoclonal Antibody Technology: Laboratory 
Techniques in Biochemistry and Molecular Biology," Elsevier Science Publishers, 
Amsterdam, The Netherlands, 1984; St. Groth et al, J. Immunol. Methods 35:1-21, 1980) 
Any animal (mouse, rabbit, and the like) which is known to produce antibodies can be 
immunized with the selected polypeptide. Methods for immunization are well known in 
the art. Such methods include subcutaneous or intraperitoneal injection of the 
polypeptide. One skilled in the art will recognize that the amount of polypeptide used for 
immunization will vary based on the animal that is immunized, the antigenicity of the 
polypeptide and the site of injection. 

The polypeptide may be modified or administered in an adjuvant in order to 
increase the peptide antigenicity. Methods of increasing the antigenicity of a polypeptide 
are well known in the art. Such procedures include coupling the antigen with a 
heterologous protein (such as globulin or p-galactosidase) or through the inclusion of an 
adjuvant during immunization. 



WO 00/73469 



PCT/USOO/14842 



91 

For monoclonal antibodies, spleen cells from the immunized animals are removed, 
fused with myeloma cells, such as SP2'0-Agl4 myeloma cells, and allowed to become 
monoclonal antibody producing hybndoma cells. Any one of a number of methods well 
known in the art can be used to identify the hybndoma cell that produces an antibody with 
the desired characteristics. These include screening the hybndomas with an ELISA assay, 
western blot analysis, or radioimmunoassay (Lutz et al. , Exp. Cell Res. 175:1 09- 1 24, 
1 988). Hybndomas secreting the desired antibodies are cloned and the class and subclass 
are determined using procedures known in the art (Campbell, "Monoclonal Antibody 
Technology: Laboratory Techniques in Biochemistry and Molecular Biology", supra, 
1984). 

For polyclonal antibodies, antibody-containing antisera is isolated from the 
immunized animal and is screened for the presence of antibodies with the desired 
specificity using one of the above-desenbed procedures. The above-described antibodies 
may be detectably labeled. .Antibodies can be detectably labeled through the use of 
radioisotopes, affinity labels (such as biotin, avidin, and the like), enzymatic labels (such 
as horse radish peroxidase, alkaline phosphatase, and the like) fluorescent labels (such as 
FITC or rhodamine, and the like), paramagnetic atoms, and the like. Procedures for 
accomplishing such labeling are well-known in the art, for example, see Stemberger et ai y 
J. Histochem. Cytochem. 18:315, 1970; Bayer era/., Meth. Enzym. 62:308-, 1979; Engval 
etaL, Immunol. 109:129-, 1972; Goding, J. Immunol. Meth. 13:215-, 1976. The labeled 
antibodies of the present invention can be used for in vitro, in vivo, and in situ assays to 
identify cells or tissues that express a specific peptide. 

The above-described antibodies may also be immobilized on a solid support. 
Examples of such solid supports include plastics such as polycarbonate, complex 
carbohydrates such as agarose and sepharose, acrylic resins and such as polyacrylamide 
and latex beads. Techniques for coupling antibodies to such solid supports are well known 
in the art (Weir et al, "Handbook of Experimental Immunology" 4th Ed., Blackwell 
Scientific Publications, Oxford, England, Chapter 10, 1986; Jacoby et a/., Meth. Enzym. 
34, Academic Press, N.Y., 1 974). The immobilized antibodies of the present invention 
can be used for in vitro, in vivo, and in situ assays as well as in immunochromotography. 
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Furthermore, one skilled in the art can readily adapt currently available procedures, 
as well as the techniques, methods and kits disclosed herein with regard to antibodies, to 
generate peptides capable of binding to a specific peptide sequence in order lo generate 
rationally designed antipeptide peptides (Hurby et ai, "Application of Synthetic Peptides 
Antisense Peptides 11 , In Synthetic Peptides, A User's Guide, W.H. Freeman, NY, pp. 289- 
307, 1992;Kaspczak et aL, Biochemistry 28:9230-9238, 1989). 

Anti-peptide peptides can be generated by replacing the basic ammo acid residues 
found in the peptide sequences of the kinases of the invention with acidic residues, while 
maintaining hydrophobic and uncharged polar groups. For example, lysine, arginine, 
and/or histidine residues are replaced with aspartic acid or glutamic acid and glutamic acid 
residues are replaced by lysine, arginine or histidine. 

The present invention also encompasses a method of detecting a kinase 
polypeptide in a sample, comprising: (a) contacting the sample with an above-described 
antibody, under conditions such that immunocomplexes form, and (b) detecting the 
presence of said antibody bound to the polypeptide. In detail, the methods comprise 
incubating a test sample with one or more of the antibodies of the present invention and 
assaying whether the antibody binds to the test sample. Altered levels of a kinase of the 
invention in a sample as compared to normal levels may indicate disease. 

Conditions for incubating an antibody with a test sample vary. Incubation 
conditions depend on the format employed in the assay, the detection methods employed, 
and the type and nature of the antibody used in the assay. One skilled in the art will 
recognize that any one of the commonly available immunological assay formats (such as 
radioimmunoassays, enzyme-linked immunosorbent assays, diffusion based Ouchterlony, 
or rocket immunofluorescent assays) can readily be adapted to employ the antibodies of 
the present invention. Examples of such assays can be found in Chard ("An Introduction 
to Radioimmunoassay and Related Techniques" Elsevier Science Publishers, Amsterdam, 
The Netherlands, 1986), Bullock et al. ('Techniques in Immunocytochemistry," Academic 
Press, Orlando, FL Vol. 1, 1982; Vol. 2, 1983; Vol. 3, 1985), Tijssen ("Practice and 
Theory of Enzyme Immunoassays: Laboratory Techniques in Biochemistry and Molecular 
Biology,' 1 Elsevier Science Publishers, Amsterdam, The Netherlands, 1985). 
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The immunological assay test samples of the present inv ention include cells, 
protein or membrane extracts of cells, or biological fluids such as blood, serum, plasma, or 
unne. The test samples used in the above-descnbed method will vary based on the assay 
format, nature of the detection method and the tissues, cells or extracts used as the sample 
to be assayed. Methods for preparing protein extracts or membrane extracts of cells are 
well known in the an and can be readily be adapted in order to obtain a sample which is 
testable with the system utilized. 

A kit contains all the necessary reagents to carry out the previously described 
methods of detection. The kit may compose: (i) a first container means containing an 
above-described antibody, and (ii) second container means containing a conjugate 
comprising a binding partner of the antibody and a label. In another preferred 
embodiment, the kit further comprises one or more other containers composing one or 
more of the following: wash reagents and reagents capable of detecting the presence of 
bound antibodies. 

Examples of detection reagents include, but are not limited to, labeled secondary 
antibodies, or in the alternative, if the primary antibody is labeled, the chromophonc, 
enzymatic, or antibody binding reagents that are capable of reacting with the labeled 
antibody. The compartmentalized kit may be as described above for nucleic acid probe 
kits. One skilled in the an will readily recognize that the antibodies described in the 
present invention can readily be incorporated into one of the established kit formats that 
are well known in the art. 

VI. Isolation of Compounds That Interact With Protein Kinases 

The present invention also relates to a method of detecting a compound capable of 
binding to a protein kinase of the invention, comprising incubating the compound with a 
kinase of the invention and detecting the presence of the compound bound to the kinase. 
The compound may be present within a complex mixture, for example, serum, body fluid, 
or cell extracts. 

The present invention also relates to a method of detecting an agonist or antagonist 
of kinase activity or kinase binding partner activity comprising incubating cells that 
produce a kinase of the invention in the presence of a compound and detecting changes in 
the level of kinase activity or kinase binding partner activity. The compounds thus 
identified would produce a change in activity indicative of the presence of the compound 
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The compound may be present within a complex mixture, for example, serum, body fluid, 
or cell extracts. Once the compound is identified it can be isolated using techniques well 
known in the art 

The present invention also encompasses a method of agonizing (stimulating) or 
antagonizing kinase associated activity in a mammal comprising administering to said 
mammal an agonist or antagonist to a kinase of the invention in an amount sufficient to 
effect said agonism or antagonism. A method of treating diseases in a mammal with an 
agonist or antagonist of kinase activity comprising administenng the agonist or antagonist 
to a mammal in an amount sufficient to agonize or antagonize kinase associated functions 
is also encompassed in the present application. 

In an effort to discover novel treatments for diseases, biomedical researchers and 
chemists have designed, synthesized, and tested molecules that inhibit the function of 
protein kinases. Some small organic molecules form a class of compounds that modulate 
the function of protein kinases. Examples of molecules that have been reported to inhibit 
the function of protein kinases include, but are not limited to, bis monocyclic, bicyclic or 
heterocyclic aryl compounds (PCT WO 92/20642, published November 26, 1992 by 
Maguire et ai), vinylene-azaindole derivatives (PCT WO 94/14808, published July 7, 
1994 by Ballinari et a/.), l-cyclopropyl-4-pyridyl-quinolones (U.S. Patent No. 5,330,992), 
styryl compounds (U.S. Patent No. 5,217,999), styryl-substituted pyridyl compounds (U.S. 
Patent No. 5,302,606), certain quinazoline derivatives (EP Application No. 0 566 266 Al ), 
seleoindoles and selenides (PCT WO 94/03427, published February 17, 1994 by Denny et 
aL), tricyclic polyhydroxylic compounds (PCT WO 92/21660, published December 10, 
1992 by Dow), and benzylphosphonic acid compounds (PCT WO 91/15495, published 
October 17, 1991 by Dow et al). 

Compounds that can traverse cell membranes and are resistant to acid hydrolysis 
are potentially advantageous as therapeutics as they can become highly bioavailable after 
being administered orally to patients. However, many of these protein kinase inhibitors 
only weakly inhibit the function of protein kinases. In addition, many inhibit a variety of 
protein kinases and will cause multiple side-effects as therapeutics for diseases. 

Some indolinone compounds, however, form classes of acid resistant and 
membrane permeable organic molecules. WO 96/22976 (published August 1 , 1 996 by 
Ballinari et al.) describes hydrosoluble indolinone compounds that harbor tetralin. 
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naphthalene, quinolme, and indole substituents fused to the oxmdole nng. These bicvelic 
substituents are in turn substituted with polar moieties including hvdroxylated alkyh 
phosphate, and ether moieties. U.S. Patent Application Serial Nos. 08/702,232, filed 
August 23, 1996. entitled "Indolinone Combinatorial Libraries and Related Products and 
Methods for the Treatment of Disease" by Tang et al (Lyon & Lyon Docket No. 221/187) 
and 08/485,323, filed June 7, 1995, entitled "Benzylidene-Z-lndoline Compounds for the 
Treatment of Disease" by Tang et al (Lyon &. Lyon Docket No. 223/298) and 
International Patent Publication WO 96/22976, published August 1 , 1 996 by Ballinan et 
al, all of which are incorporated herein by reference in their entirety, including any 
drawings, describe indolinone chemical libraries of indolinone compounds harboring other 
bicyclic moieties as well as monocyclic moieties fused to the oxindole nng. Applications 
08/702,232, filed August 23, 1996, entitled "Indolinone Combinatorial Libraries and 
Related Products and Methods for the Treatment of Disease" by Tang et al (Lyon & Lyon 
Docket No. 221/187), 08/485,323, filed June 7, 1995, entitled "Benzylidene-Z-Indolinc 
Compounds for the Treatment of Disease" by Tang et al (Lyon & Lyon Docket No. 
223/298), and WO 96/22976, published August 1, 1996 by Ballinan et al teach methods 
of indolinone synthesis, methods of testing the biological activity of indolinone 
compounds in cells, and inhibition patterns of indolinone derivatives. 

Other examples of substances capable of modulating kinase activity include, but 
are not limited to, tyrphostins, quinazolines, quinoxolines, and quinolines. The 
quinazolines, tyrphostins, quinolines, and quinoxolines referred to above include well 
known compounds such as those described in the literature. For example, representative 
publications describing quinazolines include Barker et al., EPO Publication No. 0 520 722 
Al; Jones et al, U.S. Patent No. 4,447,608; Kabbe et al, U.S. Patent No. 4,757,072; Kaul 
and Vougioukas, U.S. Patent No. 5, 316,553; Kreighbaum and Comer, U.S. Patent No. 
4,343,940; Pegg and Wardleworth, EPO Publication No. 0 562 734 Al ; Barker et al, 
Proc. of Am. Assoc. for Cancer Research 32:327 (1991); Bertino, J.R., Cancer Research 
3:293-304 (1979); Bertino. J.R., Cancer Research 9(2 part l):293-304 (1979); Curtin et 
al, Br. J. Cancer 53:361-368 (1986); Femandes et al, Cancer Research 43:1 1 17-1 123 
(1983); Fems et al J. Org. Chem. 44(2): 1 73-1 78; Fry et al, Science 265:1093-1095 
(1994); Jackman et al, Cancer Research 51:5579-5586 (1981); Jones et al J. Med. Chem 
29(6): 1 1 14-1 1 IS; Lee and Skibo, Biochemistry 26(23):7355-7362 (1987); Lemus et al, L 



WO 00/73469 



PCT/USOO/14N42 



96 

Org. Chem. 54:351 1-3518 (1989); Ley and Seng, Synthesis 1975:415-522 (1975), 
Maxwell et al , Magnetic Resonance in Medicine 1 7: 189- 196 ( 1 991 ); Mini et al, Cance r 
Research 45:325-330 (1985); Phillips and Castle, J. Heterocyclic Chem. 1 7(1 9). 1 489-1 596 
(1980); Reece et al, Cancer Research 47(1 1 ):2996-2999 (1977); Sculier et al. Cance r 
Immunol, and Immunother. 23:A65 (1986); Sikora et al, Cancer Letters 23:289-295 
(1984); Sikora et al % Analytical Biochem. 172:344-355 (1988); all of which are 
incorporated herein by reference in their entirety, including any drawings. 

Quinoxaline is described in Kaul and Vougioukas, U.S. Patent No. 5,316,553. 
incorporated herein by reference in its entirety, including any drawings. 

Quinohnes are described in Dolle et at., J. Med. Chem. 37:2627-2629 (1994); 
MaGuire, J. Med. Chem. 37:2129-2131 (1994); Burke et al, J. Med. Chem. 36:425-432 
(1993); and Burke et al. BioOrganic Med. Chem. Letters 2:1771-1774 (1992), all of which 
are incorporated by reference in their entirety, including any drawings. 

Tyrphostins are described in Allen et al. , Clin. Exp. Immunol. 91:141-156(1 993); 
Anafi et al. Blood 82:12:3524-3529 (1993); Baker et al, J. Cell Sci. 1 02:543-555 (1 992); 
Bilder et al, Amer. Physiol. Soc. pp. 6363-61 43 :C721-C730 (1991); Brunton et al, 
Proceedings of Amer. Assoc. Cancer Rsch. 33:558 (1992); Bryckaert et al, Experimental 
Cell Research 199:255-261 (1992); Dong et al. J. Leukocyte Biology 53:53-60 (1993); 
Dong et al, J. Immunol. 15 1(5):27 17-2724 (1993); Gazit et al, J. Med. Chem. 32:2344- 
2352 (1989); Gazit et al, " J. Med. Chem. 36:3556-3564 (1993); Kaur et al, Anti-Cancer 
Drugs 5:213-222 (1994); Kaur et al, King et al. Biochem. J. 275:413-418 (1991); Kuo et 
al, Cancer Letters 74:197-202 (1993); Levitzki, A., The FASEB J. 6:3275-3282 (1992); 
Lyall et al, J. Biol. Chem. 264:14503-14509 (1989); Peterson et al. The Prostate 22:335- 
345 (1993); Pillemer et al, Int. J. Cancer 50:80-85 (1992); Posner et al. Molecular 
Pharmacology 45:673-683 (1993); Rendu et al, Biol. Pharmacology 44(5):88 1-888 
(1992); Sauro and Thomas, Life Sciences 53:371-376 (1993), Sauro and Thomas, J_ 
Pharm. and Experimental Therapeutics 267(3): 1 19-1 125 (1993), Wolbnng et al, J. Biol. 
Chem. 269(36):22470-22472 (1994), and Yoneda et al, Cancer Research 51:4430-4435 
(1991); all of which are incorporated herein by reference in their entirety, including any 
drawings. 
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Other compounds that could he used as modulators include ox indoi inones such as 
those described in I'.S patent application Serial No OS 702.232 filed Aueust 23, 1996, 
incorporated herein by reference in its entirety, including anv drawings. 
VII Biological Significance. Applications and Clinical Relevance of Nove l Protein 

Kinases 

For each protein kinase in this application, we provide a classification of the 
protein class and family to which it belongs, a summary of non-cataltyic protein motifs, a 
profile of its expression in several hundred tissue and cell sources, and a chromosomal 
location. This information can be used to suggest potential function, regulation or 
therapeutic utility for each of the proteins. 

The kinase classification and protein domains often reflect pathways, cellular roles, 
or mechanisms of up- or down-stream regulation. Also disease-relevant genes often occur 
in families of related genes. For example if one member of a kinase family functions as an 
oncogene, a tumor suppressor, or has been found to be disrupted in an immune, 
neurologic, cardiovascular, or metabolic disorder, frequently other family members may 
play a related role. 

The expression analysis organizes kinases into groups that are transcriptionally 
upregulated in tumors and those that are more restricted to specific tumor types such as 
melanoma or prostate. This analysis also identifies genes that are regulated in a cell cycle 
dependent manner, and are therefore likely to be involved in maintaining cell cycle 
checkpoints, entry, progression, or exit from mitosis, oversee DNA repair, or are involved 
in cell proliferation and genome stability. Expression data also can identify genes 
expressed in endothelial sources or other tissues that suggest a role in angiogenesis, 
thereby implicating them as targets for control of diseases that have an angiogenic 
component, such as cancer, endometriosis, retinopathy and macular degeneration, and 
various ischemic or vascular pathologies. A proteins' role in cell survival can also be 
suggested based on restricted expression in cells subjected to external stress such as 
oxidative damage, hypoxia, drugs such as cisplatinum, or irradiation. Metastases- 
associated genes can be implicated when expression is restricted to invading regions of a 
tumor, or is only seen in local or distant metastases compared to the primary 7 tumor, or 
when a gene is upregulated during cell culture models of invasion, migration, or motility 



WO 00/73469 



PCT/US00/I4842 



98 

Chromosomal location can identify candidate targets for a tumor amplicon or a 
tumor-suppressor locus. Summaries of prevelant tumor amplicons are available in the 
literature, and can identify tumor types to experimentally be confirmed to contain 
amplified copies of a kinase gene which localizes to an adjacent region. 

Based on these criteria several kinases immediately stand out as being of potential 
therapeutic relevance. The protein kinases can be divided into the following disease- 
relevant categories (nucleotide Seq ID #s in parentheses): 

Tumor associated: Mok (SEQ ID NO:NO:57), EPK2, AA3 1 6804 (SEQ ID 
NO:l 1), AA435956 (SEQ ID NO:NO:48), AA278842 (SEQ ID NO:8S), AA599286 (SEQ 
ID NO:89), AA826850 (SEQ ID NO:3), HRJ (SEQ ID NO;73), MLK4 AA232253 (SEQ 
ID NO:82), AA883975 SGK 235 (SEQ ID NO:95), AA31 1714 (SEQ ID NO: 101), 
MPSK1 (SEQ ED NO:l 10), R19609 (Seq ID1 1 1), AA383293 (SEQ ID NO:26). 

Prostate-specific: AA234451 (SEQ ID NO:47), TSK4 (SEQ ID NO:93), RIP4 
(SEQ ID NO:84), KIAA0965 (SEQ ID NO:8). 

Oncogenic or proliferation associated: KIAA0781 (SEQ ID NO:38), AA789239 
(SEQ ID NO:52), CCRK (SEQ ID NO:54), CLK4 (SEQ ID NO:55), H85389 (SEQ ID 
NO:97). 

Neuronal restricted: CAMKKB (SEQ ID NO:66) 

Hematopoietic expressed: PTK9L (SEQ ID NO:22), DRAK2 (SEQ ID NO:29), 
AI025291 (SEQ ID NO:94) 

Angiogenic or endothelial expressed: DRAK1 (SEQ ID NO:31), MAK-V (SEQ 
ID NO:40), TRAD (SEQ ID NO:44), MOK (SEQ ID NO:57), AA08847 (SEQ ID NO:78), 
HGP__66444466 (SEQ ID NO:79), RSK4 (SEQ ID NO: 16). 

Cell cycle regulated: AA454060 (SEQ ID NO:45), KIAA0999 (Mitotic - SEQ ID 
NO:32), AA579641 (Mitotic - SEQ ED NO:60), AA305176 (Mitotic - SEQ ID NO:6), 
AA018361 (SI phase -SEQ ED NO: 100). 
VIII. Transgenic Animals. 

A variety of methods are available for the production of transgenic animals 
associated with this invention. DNA can be injected into the pronucleus of a fertilized egg 
before fusion of the male and female pronuclei, or injected into the nucleus of an 
embryonic cell (e.g., the nucleus of a two-cell embryo) following the initiation of cell 
division (Brinster et aL, Proc. Nat. Acad. Sci. USA 82: 4438-4442, 1985) Embryos can 
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he infected wiih viruses, especially retro\'i ruses, modified to curry inorganic -ion receptor 
nucleotide sequences of the invention. 

Plunpotent stem cells derived from the inner cell mass of the embryo and 
stabilized in culture can be manipulated in culture to incorporate nucleotide sequences of 
the invention. A transgenic animal can be produced from such cells through implantation 
into a blastocyst that is implanted into a foster mother and allowed to come to term. 
Animals suitable for transgenic experiments can be obtained from standard commercial 
sources such as Charles Rjver (Wilmington, MA), Taconic (Germantown, NY), Harlan 
Sprague Dawley (Indianapolis, IN), etc. 

The procedures for manipulation of the rodent embryo and for microinjection of 
DNA into the pronucleus of the zygote are well known to those of ordinary skill in the art 
(Hogan et al., supra). Microinjection procedures for fish, amphibian eggs and birds are 
detailed in Houdebine and Chourrout (Expenentia 47: 897-905, 1 99 1 ). Other procedures 
for introduction of DNA into tissues of animals are described in U.S. Patent No., 
4,945,050 (Sanford et al, July 30, 1990). 

By way of example only, to prepare a transgenic mouse, female mice are induced 
to superovulate. Females are placed with males, and the mated females are sacrificed by 
CO2 asphyxiation or cervical dislocation and embryos are recovered from excised 
oviducts. Surrounding cumulus cells are removed. Pronuclear embryos are then washed 
and stored until the time of injection. Randomly cycling adult female mice are paired with 
vasectomized males. Recipient females are mated at the same time as donor females. 
Embryos then are transferred surgically. The procedure for generating transgenic rats is 
similar to that of mice (Hammers a/., Cell 63:1099-1112, 1990). 

Methods for the cultunng of embryonic stem (ES) cells and the subsequent 
production of transgenic animals by the introduction of DNA into ES cells using methods 
such as electroporation, calcium phosphate/DNA precipitation and direct injection also are 
well known to those of ordinary skill in the art (Teratocarcinomas and Embryonic Stem 
Cells, A Practical Approach, E.J. Robertson, ed., PRL Press, 1987). 

In cases involving random gene integration, a clone containing the sequence(s) of 
the invention is co-transfected with a gene encoding resistance. Alternatively, the gene 
encoding neomycin resistance is physically linked to the sequence(s) of the invention 
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Transfcction and isolation of desired clones are earned out by any one of several methods 
well known to those of ordinary skill in the art (E.J. Robertson, supra) 

DNA molecules introduced into ES cells can also be integrated into the 
chromosome through the process of homologous recombination (Capecchi, Science 244: 
1288-1292, 1989). Methods for positive selection of the recombination event (i.e., neo 
resistance) and dual positive-negative selection (i.e., neo resistance and gancyclovir 
resistance) and the subsequent identification of the desired clones by PCR have been 
described by Capecchi, supra and Joyner et ai (Nature 338: 153-156, 1989), the teachings 
of which are incorporated herein in their entirety including any drawings. The final phase 
of the procedure is to inject targeted ES cells into blastocysts and to transfer the 
blastocysts into pseudopregnant females. The resulting chimeric animals are bred and the 
offspring are analyzed by Southern blotting to identify individuals that carry the transgene. 
Procedures for the production of non-rodent mammals and other animals have been 
discussed by others (Houdebme and Chourrout, supra\ Pursel et ai, Science 244:1281- 
1288, 1 989; and Simms et ai , Bio/Technology 6: 179-183, 1 988). 

Thus, the invention provides transgenic, nonhuman mammals containing a 
transgene encoding a kinase of the invention or a gene effecting the expression of the 
kinase. Such transgenic nonhuman mammals are particularly useful as an in vivo test 
system for studying the effects of introduction of a kinase, or regulating the expression of 
a kinase (i.e., through the introduction of additional genes, antisense nucleic acids, or 
ribozymes). 

A 'transgenic animal" is an animal having cells that contain DNA which has been 
artificially inserted into a cell, which DNA becomes part of the genome of the animal 
which develops from that cell. Preferred transgenic animals are primates, mice, rats, 
cows, pigs, horses, goats, sheep, dogs and cats. The transgenic DNA may encode human 
STE20-related kinases. Native expression in an animal may be reduced by providing an 
amount of anti-sense RNA or DNA effective to reduce expression of the receptor. 

IX. Gene Therapy 

Protein kinases of the invention, or their genetic sequences will also be useful m 
gene therapy (reviewed in Miller, Nature 357:455-460, 1992). Miller states that advances 
have resulted in practical approaches to human gene therapy that have demonstrated 
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positive initial results. The basic science of gene therapy is described in Mulligan 
(Science 260:926-931, 1993). 

In one preferred embodiment, an expression vector containing protein kinase 
coding sequence is inserted into cells, the cells are grown in vitro, and then are infused in 
large numbers into patients. In another preferred embodiment, a DNA segment containing 
a promoter of choice (for example a strong promoter) is transferred into cells containing 
an endogenous gene encoding kinases of the invention in such a manner that the promoter 
segment enhances expression of the endogenous kinase gene (for example, the promoter 
segment is transferred to the cell such that it becomes directly linked to the endogenous 
kinase gene). 

The gene therapy may involve the use of an adenovirus containing kinase cDNA 
targeted to a tumor, systemic kinase increase by implantation of engineered cells, injection 
with kinase-encoding virus, or injection of naked kinase DNA into appropriate tissues. 

Target cell populations may be modified by introducing altered forms of one or 
more components of the protein complexes in order to modulate the activity of such 
complexes. For example, by reducing or inhibiting a complex component activity within 
Target cells, an abnormal signal transduction event(s) leading to a condition may be 
decreased, inhibited, or reversed. Deletion or missense mutants of a component, that 
retain the ability to interact with other components of the protein complexes but cannot 
function in signal transduction may be used to inhibit an abnormal, deleterious signal 
transduction event. 

Expression vectors derived from viruses such as retroviruses, vaccinia virus, 
adenovirus, adeno-associated virus, herpes viruses, several RNA viruses, or bovine 
papilloma virus, may be used for delivery of nucleotide sequences (e.g., cDNA) encoding 
recombinant kinase of the invention protein into the targeted cell population {e.g., tumor 
cells). Methods which are well known to those skilled in the art can be used to construct 
recombinant viral vectors containing coding sequences (Maniatis ei ai, Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, N.Y., 1989; Ausubel ci 
ai, Current Protocols in Molecular Biology, Greene Publishing Associates and Wiley 
Interscience, N.Y., 1989). Alternatively, recombinant nucleic acid molecules encoding 
protein sequences can be used as naked DNA or in a reconstituted system e g ., liposomes 
or other lipid systems for delivery to target cells (e.g.. Feigner et ai, Nature 337:387-8, 
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1989). Several other methods for the direct transfer of plasmid DNA into cells exist for 
use in human gene therapy and involve targeting the DNA to receptors on cells by 
complexing the plasmid DNA to proteins (Miller, supra). 

In its simplest form, gene transfer can be performed by simply injecting minute 
amounts of DNA into the nucleus of a cell, through a process of microinjection (Capecchi, 
Cell 22:479-88, 1980). Once recombinant genes are introduced into a cell, they can be 
recognized by the cell's normal mechanisms for transcription and translation, and a gene 
product will be expressed. Other methods have also been attempted for introducing DNA 
into larger numbers of cells. These methods include: transfection, wherein DNA is 
precipitated with CaP0 4 and taken into cells by pinocytosis (Chen et al, Mol. Cell Biol. 
7:2745-52, 1987); electroporation, wherein cells are exposed to large voltage pulses to 
introduce holes into the membrane (Chu et al. , Nucleic Acids Res. 15:1311 -26, 1 987); 
lipofection/liposome fusion, wherein DNA is packaged into lipophilic vesicles which fuse 
with a target cell (Feigner et al., Proc. Natl. Acad. Sci. USA. 84:7413-7417, 1987); and 
particle bombardment using DNA bound to small projectiles (Yang et al., Proc. Natl. 
Acad. Sci. 87:9568-9572, 1990). Another method for introducing DNA into cells is to 
couple the DNA to chemically modified proteins. 

It has also been shown that adenovirus proteins are capable of destabilizing 
endosomes and enhancing the uptake of DNA into cells. The admixture of adenovirus to 
solutions containing DNA complexes, or the binding of DNA to polylysine covalently 
attached to adenovirus using protein crosslinking agents substantially improves the uptake 
and expression of the recombinant gene (Curiel et ai, Am. J. Respir. Cell. Mol. Biol., 
6:247-52, 1992). 

As used herein "gene transfer 1 ' means the process of introducing a foreign nucleic 
acid molecule into a cell. Gene transfer is commonly performed to enable the expression 
of a particular product encoded by the gene. The product may include a protein, 
polypeptide, anti-sense DNA or RNA, or enzymatically active RNA. Gene transfer can be 
performed in cultured cells or by direct administration into animals. Generally gene 
transfer involves the process of nucleic acid contact with a target cell by non-specific or 
receptor mediated interactions, uptake of nucleic acid into the cell through the membrane 
or by endocytosis, and release of nucleic acid into the cytoplasm from the plasma 
membrane or endosome. Expression may require, in addition, movement of the nucleic 
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acid into the nucleus of the cell and binding to appropriate nuclear factors for 
transcription. 

As used herein "gene therapy" is a form of gene transfer and is included within the 
definition of gene transfer as used herein and specifically refers to gene transfer to express 
a therapeutic product from a cell in vivo or in vitro. Gene transfer can be performed ex 
vivo on cells which are then transplanted into a patient, or can be performed by direct 
administration of the nucleic acid or nucleic acid-protein complex into the patient. 

Ln another preferred embodiment, a vector having nucleic acid sequences encoding 
a protein kinase polypeptide of the invention is provided in which the nucleic acid 
sequence is expressed only in specific tissue. Methods of achieving tissue-specific gene 
expression are set forth in International Publication No. WO 93/09236, filed November 3, 
1992 and published May 13, 1993. 

In all of the preceding vectors set forth above, a further aspect of the invention is 
that the nucleic acid sequence contained in the vector may include additions, deletions or 
modifications to some or all of the sequence of the nucleic acid, as defined above. 

In another preferred embodiment, a method of gene replacement is set forth. 
*/'Gene replacement" as used herein means supplying a nucleic acid sequence which is 
capable of being expressed in vivo in an animal and thereby providing or augmenting the 
function of an endogenous gene that is missing or defective in the animal. 
X. Administration of Substances 

Methods of determining the dosages of compounds to be administered to a patient 
and modes of administering compounds to an organism are disclosed in U.S. Application 
Serial No. 08/702,282, filed August 23, 1996 and International patent publication number 
WO 96/22976, published August 1 1996, both of which are incorporated herein by 
reference in their entirety, including any drawings, figures, or tables. Those skilled in the 
art will appreciate that such descriptions are applicable to the present invention and can be 
easily adapted to it. 

The proper dosage depends on various factors such as the type of disease being 
treated, the particular composition being used, and the size and physiological condition of 
the patient. Therapeutically effective doses for the compounds described herein can be 
estimated initially from cell culture and animal models. For example, a dose can be 
formulated in animal models to achieve a circulating concentration range that initially 
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takes into account the IC50 as determined in cell culture assays The animal model data 
can be used to more accurately determine useful doses in humans. 

Plasma half-life and biodistribution of the drug and metabolites in the plasma, 
tumors, and major organs can be also be determined to facilitate the selection of drugs 
5 most appropriate to inhibit a disorder. Such measurements can be carried out. For 

example, HPLC analysis can be performed on the plasma of animals treated with the drug 
and the location of radiolabeled compounds can be determined using detection methods 
such as X-ray, CAT scan, and MRI. Compounds that show potent inhibitory activity in 
the screening assays, but have poor pharmacokinetic characteristics, can be optimized by 

10 altering the chemical structure and retesting. In this regard, compounds displaying good 

pharmacokinetic characteristics can be used as a model. 

Toxicity studies can also be earned out by measuring the blood cell composition. 
For example, toxicity studies can be carried out in a suitable animal model as follows: 1 ) 
the compound is administered to mice (an untreated control mouse should also be used), 

1 5 2) blood samples are periodically obtained via the tail vein from one mouse in each 

treatment group; and 3) the samples are analyzed for red and white blood cell counts, 
blood cell composition, and the percent of lymphocytes versus polymorphonuclear cells. 
A comparison of results for each dosing regime with the controls indicates if toxicity is 
present. 

20 At the termination of each toxicity study, further studies can be earned out by 

sacrificing the animals (preferably, in accordance with the American Veterinary Medical 
Association guidelines Report of the American Veterinary Medical Assoc. Panel on 
Euthanasia, Journal of American Veterinary Medical Assoc., 202 :229-249, 1993). 
Representative animals from each treatment group can then be examined by gross 

25 necropsy for immediate evidence of metastasis, unusual illness, or toxicity. Gross 

abnormalities in tissue are noted, and tissues are examined histologically. Compounds 
causing a reduction in body weight or blood components are less preferred, as are 
compounds having an adverse effect on major organs. In general, the greater the adverse 
effect the less preferred the compound. 
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For the treatment of cancers the expected daily dose of a hydrophobic 
pharmaceutical agent is between 1 to 50u mg/day, preferably 1 to 250 mg/day, and most 
preferably 1 to 50 mg/day. Drugs can be delivered less frequently provided plasma levels 
of the active moiety are sufficient to maintain therapeutic effectiveness. 

Plasma levels should reflect the potency of the drug. Generall y, the more potent th 
compound the lower the plasma levels necessary to achieve efficacy. 

EXAMPLES 

The examples below are not limiting and are merely representative of various 
aspects and features of the present invention. The examples below demonstrate the 
isolation and characterization of the protein kinases of the invention. 

EXAMPLE 1 : Isolation of cDNA clones Encoding Novel Mammalian Protein Kinases 
Materials and Methods Identification from cDNA databases and isolation of clones 
encoding novel protein kinases 

Novel kinases were identified from the public EST databases using a Hidden 
Markov model, abbreviated HMM (Krogh, A., Brown, M., Mian, I. S., Sjolander, K., and 
Haussler, D. 1994. Hidden Markov models in computational biology; Applications to 
protein modeling. J. Mol Biol, 235:1501-1531). The model was built with 70 
mammalian and yeast kinase catalytic domain sequences. These sequences were chosen 
from a comprehensive collection of kinases such that no two sequences had more than 
50% sequence identity. ESTs were translated in six open reading frames and were 
searched against the model. ESTs that had a score of at least 10 against the HMM were 
then masked for repetitive sequences and vectors and were clustered using MSA. The 
resulting contigs were searched against known kinases to identify EST clones that encode 
novel kinases. 

Approximately 40% of the ESTs encoding potentially novel kinases did not 
correspond to the correct EST upon sequence analysis. Most of these discrepancies were 
resolved by ordering additional clones, however, 14 remained unavailable. These 14 ESTs 
were amplified from a variety of single-stranded cDNA sources with primers derived from 
the corresponding EST entry as shown on Table 5. The PCR product was subcloned into a 
bluesenpt vector, digested to confirm the presence of a correct size insert and sequenced. 
Full sequencing of EST and PCR was carried out using a cycle sequencing Big-dye kit 
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with AmphTaq DNA Polymerase, FS (ABf Foster City, CA). Sequencing reaction 
products were run on an ABI Prism 377 DNA Sequencer. 

Table 5: Primers used to clone PCR products corresponding to novel kinases 
i ! ID# 



Pa rem 



1 S P 


_4 „ 

' na 

i 

—4— — — 


| aa 


Sequence 


I Sequence* 


Sequence * 


n 




l s t 
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iK2i-> 1 j 


GAGATCGRNTTV AARGA 

RTTYGA 
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TGTCACNCCNAGNSWCCAN 

AYRTT 
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i " 
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i 
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GCTGCTGGACAGTGACT 
TGTA1TJ 
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uAAAuCAAAuCCTT'CACAC 

CTT ! 
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M 
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O / 


1 1 C"7 
1 6 / 


[ 5K69_ 3 /_2_h 


CTCTC A CCTC A G G A A CT 
GG 


G CTTG CG G A TCTTCTC A 


H 


46 


| 166 

i 


SGK309_h 


GACATCCTGCCGGCCAA 
CTACG 


v_ajvjv^A-.^, | uUAOL i OCATCA 
CTA 


M 


67 


228 


5R72J6_2_h 


TGCGCGACACCATTGAC 
CAG 


CTCAGGGCTTACATACAGA 
G 


H 


45 


165 


5R72_8_2_h 


AAAGGAGAACTACATTT 
TGAAAAT 


CJ-1CATCATCTCTAATACAT 
TGGTTGG 


H 


41 


161 


Z36720 


CAAATTAAGATCA7TGA 
CTTTGGG 


GGAAACAAAGTCCTTGGCC 
TC 


H 


I 15 


234 


AL031652 - 
Pak6 


GTGGACATCTGGTCCCT 
CG 


GTAGGTCCTTCACTCTTGG 
AG 



N= A,C,G ot T 
R= A or G 
Y= C or T 
S = C or G 
W= A or T 



Full-length sequence extension of protein kinases using cDNA and genomic databases 

Extension of partial cDNA sequences to encompass the full-length open-reading 
frame was carried out by iterative blastn searching of the cDNA databases listed in Table 
6. All blastn searches were conducted using a blosum62 matrix, a penalty for a nucleotide 
mismatch of -3 and reward for a nucleotide match of 1 . The gapped blast algortihm is 
described in: (Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui 
Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and 
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PSI-BLAST: a new gener ation of protein database search programs". Nucleic Acids Res 
25:3389-3402). 



Table 6. Databases used for cDNA-based sequence extensions 



1 ■ — ' — ■ 

Database 


Database Date 


LifeGold templates 

j — 


Feb 2000 


! LifeGold compseqs 


Feb 2000 


LifeGold compseqs 


Mar 2000 


LifeGold compseqs 


Apr 2000 


LifeGold fl 


Feb 2000 


LifeGold flft | 


Apr 2000 


NCBI human Ests 


May 2000 


NCBI murine Ests 


May 2000 


NCBI nonredundant 


May 2000 



Extension of partial cDNA sequences to encompass the full-length open-reading 
- ■ frame was also carried out by iterative searches of genomic databases. Three methods 
were used. The first method made use of the Smith- Waterman algorithm to carry out 
protein-protein searches of the closest homologue or orthologue to the partial kinase. The 
target databases consisted of Genescan and open-reading frame (ORF) predictions of all 
human genomic sequence derived from the human genome project (HGP) as well as from 
Celera. The complete set of genomic databases searched is shown in Table 7 below. 
Genomic sequences encoding potential extensions were further assessed by blastp analysis 
against the NCBI nonredundant to confirm the novelty of the hit. The extending genomic 
sequences were incorporated into the cDNA sequence after removal of potential introns 
using the Seqman program from DNAStar. The default parameters used for Smith- 
Waterman searches were as shown next. Matrix: blosum 62; gap-opening penalty: 12; gap 
extension penalty: 2. Genescan predictions were made using the Genescan program as 
detailed in (Chris Burge and Sam Karlin "Prediction of Complete Gene Structures in 
Human Genomic DNA'\ JMB (1997) 268(1 ):78-94). ORF predictions from genomic 
DNA were made usina a standard 6-frame translation. 
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The second method for genomic sequence-based extensions made use of iBlastn 
searches of the homologue or orthologue to the partial kinase against the cDNA databases 
listed in Table 7. The recognition of significant hits in these databases made possible to 
identify bridging partial cDNA clones. The iterative application of the two methods made 
possible the assemblage of the virtual full-length sequence for a large number of the 
kinases presented in this application. All tblastn searches were conducted using a 
blosum62 matrix, a penalty for a nucleotide mismatch of - 3 and reward for a nucleotide 
match of 1 . 

The last method for defining cDNA extensions from genomic sequence used 
iterative searches of genomic databases through the Genescan program to predict exon 
splicing and the Genewise program (http://www.sanger.ac. uk/Software/Wise2/ ) to predict 
potential ORFs based on homology to the closest orthologue/homologue. 
Table 7. Databases used for genomic-based sequence extensions 



Database 


Number of entries 


Database Date 


Celera v. 1-5 


5,306,158 


Jan 1 9/00 


Celera v. 6-10 


4,209,980 


Mar 24/00 | 


Celera v. 1 1-14 


7,222,425 


Apr 24/00 


Celera v. 1 5 


243,044 


May 14/00 


HGP all Genescan 


25,885 


Apr 04/00 


HGP; Phase 0 


4,944 


May 04/00 


HGP; Phase 1 


28,478 


May 05/00 


HGP; Phase 2 


1,508 


May 04/00 


HGP; Phase 3 


9,971 


May 05/00 

, 



Virtual Extensions 

Human AA826850 (SEQ ID NO: 3, SEQ ID NO: 124) 

Blastn analysis of the partial AA826S50 sequence revealed an extension to 
encompass the complete ORE in the Incyte EST 238299.1 . A frame-shift correction at 
position 595 of this EST (marked by X in NA sequence) generated an uninterrupted ORE. 

Human AA960957 (SEQ ID NO: 4, SEQ ID NO: 125) 
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Since the initial filing of this application, the partial AA960957 sequence appeared 
m the public database as the full-length gene for a protein kinase encoded by a gene that 
maps adjacent to the eve ( AJ250S39) (elhs-van creveld syndrome and weyers acrodental 
dysostosis) gene from 4p 16.1. 

Human 5R79-46-l_h (SEQ ID NO: 5, SEQ ID NO:126) 
Blastn analysis of the partial 5R79-46-1 sequence revealed an extension to 
encompass the complete ORF in the Incyte EST 463894.6. Since the initial filing of this 
application, the full-length virtual 5R79-46-1 appeared in the public database as the full- 
length gene for the TANK-binding kinase (TBK1) (Pomerantz,J.L. and Baltimore, D. 
(1999) EMBO J. 18 (23), 6694-6704). TBK1 participates in NF-kB activation through the 
formation of a signaling complex with TRAF2 and TANK. 

Human AA305176 (SEQ ID NO: 6, SEQ ID NO: 127) 
Blastn analysis of the partial AA305176 sequence revealed an extension to 
encompass the complete ORF in the Lncyte EST 220937.1. 

Human AA256100 (SEQ ID NO: 8, SEQ ID NO:129) 
Blastn analysis of the partial AA256100 sequence revealed an extension to 
encompass the complete ORF through the assembly of three partial clones: Incyte EST 
480S15.6, KIAA0965 (BAA76809) and AA256100. 

Human AA2 10825 (SEQ ID NO: 9, SEQ ID NO: 130) 
Blastn analysis of the partial AA2 10825 sequence revealed an extension to 
encompass the nearly complete ORF through the assembly of three partial clones: Incyte 
EST 014721.7, and the NCBI EST's AW01 158 and AA210825. An insertion of two "N's" 
at positions 1915 and 1916 generated an uninterrupted ORF. Blastx analysis indicated the 
possibility of a start Met in the range of 400-450 nucleotides (i.e. compared to the closest 
homolog, human PKCmu (CAA53384.1). However, no Met was found in this region; 
rather ORF ends in an in-frame stop preceeded by the sequence 

li RGLLAPGDPPCPPPNPAPATPPSSRLPTELFSNFCDS , \ It is possible that part of the 
sequence covered by nucleotide positions 1-400 derived from AW01 158 comes from an 
intron, explaining the absence of a start Met. 

Human AA1 27299 (SEQ ID NO: 10, SEQ ID NO: 131) 

No entries in the database extended this sequence. The 1684 bp insert of this EST 
contains a 1369 bp intron at the 3 1 end. Blastx and SW analysis of the 315 bp coding 
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region revealed homology to the extracatalytic C2 domain of PKC. This EST, may or may 
not encode a kinase. 

Human AA3 16804 (SEQ ID NOT 1, SEQ ID NO: 132) 

Since the initial filing of this application, the partial AA3 16804 sequence appeared 
5 in the public database as the full-length gene for the PKC family protein kinase EFK2 or 

PKCnu (ABO 15982). 

Human HI 91 02 (SEQ ID NO: 1 4, SEQ ID NOT 35) 

Genewise and Genescan analyses of the partial HI 91 02 sequence revealed an 
extension from the HGP phase 3 contig 3810672 to encompass the complete catalytic 

10 domain of this EST. Blastn analysis against the non-redundant database revealed that this 

gene is found in the cosmid AC005726 from chromosome 17. H 191 02 may encode a dual 
catalytic kinase given the homology to S6 kinase. Analysis of genomic sequence upstream 
of the 5' end of HI 9 102 revealed a non-kinase gene oriented in the same polarity as 
HI 91 02 suggestive of the start Met for HI 91 02 being close to the 5 5 end of the HI 91 02 

15 sequence. From this analysis it is deduced that the second catalytic domain of HI 91 02, if 

present, is most likely located within the 47334-185,215 bp region of the genomic 
sequence of AC005726. 

Human AA476563 (SEQ ID NO: 1 5, SEQ ED NO: 1 36) 

Since the initial filing of this application, the partial AA476563 sequence appeared 
20 in the public database as the full-length gene for the protein kinase RPS6KC1 

(NM_012424) (Zhang, H. et al Genomics (1999) 61, 314-318), which is an S6 kinase 
mapping to 1 2q 1 2-q 1 3 . 1 . 

Human AA626690 (SEQ ID NOT 6, SEQ ID NOT 37) 

Since the initial filing of this application, the partial AA626690 sequence appeared 
25 in the public database as the full-length gene for the protein kinase RPS6KA6 (AF 184965) 

(Yntema,H.G et al (1999) Genomics 62, 332-343), an S6 kinase commonly deleted in 
patients with complex X-Iinked (Xq21.1 ) mental retardation. 

Human All 1 5680 (SEQ ID NO: 1 7, SEQ ID NO: 1 38) 

Since the initial filing of this application, the partial AI2 15680 sequence appeared 
30 in the public database as the full-length gene encoding a hypothetical protein (AAJ3301 82) 

from the locus AC006530.4 from chromosome 14. 

Human AA887783 (SEQ ID NO:21, SEQ CD NOT42) 
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Blastn analysis of trie partial AA8S77S3 sequence revealed an extension to 
encompass the nearly complete ORF through the assembly of three partial clones: Incyte 
415390R6 and the NCBI HST's AASS77S3 and N94726. Since the initial filing of this 
application, the nearly full-length virtual AASS7783 sequence appeared in the public 
database as the full-length gene encoding SGK3 (AF1 69035), a serum- and 
glucocorticoid-induced protein kinase (Kobayashi,T\ et al (1999) Biochemical J. 344, 189- 
197. 

Human R47805 (SEQ ID NO:22, SEQ ID NO: 143) 

A cDNA clone encoding the full-length ORF of R47805 was isolated using 
R47805 as a screening probe. A full-length form for R47805 has also appeared in the 
public database as 

PTK9L (NM__0072S4), an A6-related protein kinase. 

Human H60215 (SEQ ID NO:23, SEQ ID NO: 144) 

Blastn analysis of the partial H60215 sequence revealed an extension to encompass 
the complete ORF in the public EST AI275726. This was confirmed through the full insert 
sequencing of this EST (2,310 bp) which corresponds to the sequence under SEQ ID 
NO:144. 

A different stop codon was predicted for AI275726 compared to H60215 due to a 
single nucleotide insertion at position 1586 in AI275726. Evidence for the extra nucleotide 
comes from EST Ad 191922. 

SGK324Ji orthologue of W30246_m (SEQ ID NO:24 , SEQ ID NO: 145) 

Blastn, blastx and Smith-Waterman analyses of genomic databases revealed an 
extension to encompass the complete ORF corresponding to the human orthologue of 
murine W30246. Exons predicted from the following sequences were used for contig 
construction: Celera 17000189645083, 17000057549105 and 11000501939981; 
Incyte] 42404.1, HGP_ 72491 19, Incyte 71 96489H1, Celera 11000501939981, 
17000028165594; Incyte 72491 1 9_3, Celera 17000035772368, 1 1000502081575 and 
17000140274329. The latter Celera sequence provides the N-terminus. 

Human AA383293 (SEQ ID N0 26, SEQ ID NO:147) 

Blastn, blastx and Smith-Waterman analyses of genomic databases revealed an 
extension to encompass the complete ORF corresponding for AA383293. Exons predicted 
from the following sequences were used for contig construction: (numbers in parenthesis 
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refer to the aa sequence of the closest homolog (RU2S, NP JJ57440) used for the Smith- 
Waterman query): N-term from Incytc 60101 75_ 2 (14-97), lncyte 6981981 (134-184) 
7596749 (186-232) Celera 17000020789545 (243-301 ) CAB7561 9. 1 (3 10-341 )--(56-l 45 
DCX homology) 6010175_2 , Celera 17000030058129 (241-262 DCX homology). 
5 Human AA021445 (SEQ ID NO:32, SEQ ID NO:152) 

Blastn analysis revealed an extension to encompass the nearly complete ORF 
corresponding for AA021445. Contig reconstruction was as follows: nucleotides i -802 
from K1AA0999 (AB023216); nucleotides 803-4321 from full-insert sequence of 
AA021445. A pairwise alignment between the AA021445 and K1AA0999 revealed three 
10 inserts in the extracatalytic C-terminus of 48, 48 and 161 aminoacids. In addition, both 

AA021445 and K1AA0999 have 15 copies of a CAG repeat. Trinucleotide repeats are 
often found in genes that linked to neurodegenerative diseases. 
Human 2R22-55-1 (SEQ ID NO:33, SEQ ID NO: 153) 
Blastn analysis revealed an extension in the lncyte EST clone 321074.1 to 
15 encompass the complete ORF corresponding to 2R22-55-1. 

Human orthologue of AA544838_m (SEQ ID NO:36, SEQ ID NO: 156) 
tBlastn analysis identified the partial human K1AA0135 (U79240) clone as the 
human orthologue of murine AA544838. Blastn revealed an extension KlAA0135_h 
(U79240) to encompass the complete ORF. The full ORF was reconstructed from 
20 Incyte406786.5, KFZp430051 andKIAA0135 (U79240). 

Human orthologue of AI785735_m (SEQ ID NO:38, SEQ ID NO: 1 58) 
tBlastn analysis identified the partial human KIAA0781 (ABO 18324) clone as the 
human orthologue of murine AI785735. Blastn revealed an extension KIAA0135_h 
(U79240) to encompass the complete ORF. The full ORF was reconstructed from lncyte 
25 986123.37 KIAA0781 (AB018324). 

Human AA207220 (SEQ ED NO: 39, SEQ ID NO: 1 59) 

Blastn analysis revealed an extension to encompass the nearly complete ORF 
corresponding for AA021445. The full ORF was reconstructed from lncyte 402740. 1 and 
AA207220. Frame corrections: deletion of 441 and 595 over lnc402740.1 seq based on 
30 blastx to keep frame open; two n insertions 940, 941 over AA207220 to keep frame open 

Human AA426580 (SEQ ID NO:40, SEQ ID NO: 1 60) 
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Since the initial filing of this application, the panial AA426580 sequence appeared 
in the public database as the full-length gene encoding MA.K-Y (AJ271"22) from 
chromosome 21q22.1. 

Human 5R79-54-1 (SEQ ID NO: 41, SEQ ID NO: 161) 

Genevvise and Genescan analyses of the partial 5R79-54-1 sequence revealed an 
extension from genomic sequence to encode the full ORF for 5R79-54-1 . 

Human orthologue of AA542015_m (SEQ ED NO: 42, SEQ ID NO: 162) 

tBlastn analysis identified KIAA1297 (AB037718). Blastn extended the 
KLAA1297 sequence to provide the C-terminus through the Incyte 224074.1 EST. The 
partial ORF consists of a dual catalytic domain flanked by 6 Ig domains and 2 fibronectin 
repeats. Based on homology to the bt drosophila protein (AAF593 1 6. 1 ), the human form 
of AA5420 1 5 is expected to be missing 1 6 Ig domains. 

Human R19772 (SEQ ID NO:44, SEQ ID NO: 164) 

The full-length ORF for R19772 was isolated by screening a cDNA library using a 
probe derived from R19772. Since the initial filing of this application, the R19772 
sequence appeared in the public database as the full-length gene encoding Trio (Duet) 
(AB01 1422). CDNA library screening revealed multiple isoforms for this gene which are 
summarized in the Table below. 



Table 8. Isoforms for R 19772 



Kestrl Name 


Kestrl 

AA Acc # 


Isoform 
type 


Source 


■ ■ — 

Description* 


Trad (Duet) 


R19772 
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Deletion of K at 124 
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Deletion of Q at 616 
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Substitution of E for G at 
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! i Deletion of 32 aa (160-191) I 
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D 


Lung tumor j Deletion of Q at 61 6 
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, 

1 

1 

— f ' 


i Deletion of 32 aa (160-191 ) 

...--4 -■ - \ 



_ L 



Deletion of 32 aa (360-191) 



* reference amino acid position are with respect to sequence of Trad (AB01 1422) 



Human AA435956 (SEQ ID NO:48, SEQ ID NO: 168) 
5 Blastn analysis revealed an extension to encompass the nearly complete catalytic 

region of AA435956. 5' end sequence extension was provided by genomic locus 
AC007242.3_h (range 44880-43801). Based on blastx analysis, the extended sequence 
encodes is full-length at the C-terminus. 

Human AA397553 (SEQ ID NO: 5 1 , SEQ ID NO: 1 7 1 ) 
10 Since the initial filing of this application, the partial AA397553 sequence appeared 

in the public database as the full-length gene encoding CRK7 (AF227198), a novel CDC2- 
related protein kinase that colocalizes with interchromatin granule clusters. 

Human AA789239 (SEQ ID NO: 52, SEQ ID NO: 1 72) 

Since the initial filing of this application, the partial AA789239 sequence appeared 
15 in the public database as the full-length gene encoding NKIAMRE (AF1 30372), a novel 

kinase deleted in human leukemia. 

Human AA63 1 990 (SEQ ID NO:55, SEQ H) NO: 1 75) 
Blastn analysis revealed an extension to encompass the full-length ORF for 
AA631990. The full ORF was reconstructed from 253847.5 and AA631990 and 
20 AA207220. Frame corrections: delete 1 C at 1380, delete 2N's at 2033/2034. 

Human AA557536 (SEQ ED NO:56, SEQ ID NO: 176) 
Blastn analvsis revealed an extension to encompass full-length ORF for 
AA557536. The full ORF was reconstructed from AA557536, celera 1 1000504061899 
and the Incyte 097089.1 EST. An 85bp intron was removed from AA557536. 
25 Human N34132 (SEQ ID NO: 63, SEQ ID NO:183) 

Full sequencing of EST N34 132 (1.3 kb) confirmed that this cDNA encodes a 
novel NEK-subfamily kinase. Blast analysis against the EST database showed that four 
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EST sequences (AA283140, AA2S3140, AA28291 1 and N5301 1 ) extended the sequence 
of N341 32 at the 3' end to form a 2.3 1 kb contig. Blast analysis of the new contig against 
the nonredunat public database showed that the N34132 extended contig overlapped 
(100% identity) over 228 bp at its 3 1 end with human KLAA0344 (AB002342), a 5, 787 bp 
cDNA encoding a 1246 aa polypeptide. The 5' 790 bp of the KIAA0344 cDNA (encoding 
the 58 N-terminal protein sequence) were found to be divergent with respect to the 
extended 2.32 kb N34132 contig. Evidence that the extended N34132 contig (2.3 1 kb) and 
KIAA0344 (AB002342) belong to the same gene is the following. First, blast analysis of 
the nucleotide sequences for N34132 and KIAA0344 against the NRN database confirmed 
that these cDNA's are transcribed from the same genomic locus defined by two 
overlapping BACs (AC004765 and AC004803) from chromosome 12pl3.3. Second, full 
sequence determination of a PCR fragment amplified from single-stranded cDNA 
confirmed the junction between the extended N34132 contig and KIAA0344_h 
(AB002342). The 462 PCR product was amplified with primers 

CTCCTCAACAGACAGTGCAG (5' primer) and GACATTCTACTACTCGGTCTC (3 7 
primer) designed from the N34132 extended contig and K1AA0344 sequences, 
'-respectively. The region of N34132 containing the start Met was isolated by PCR from a 
testis cDNA library (Clontech). 

Human 5R69-17-2 (SEQ ID NO:67, SEQ ID NO: 187) 

The full-length ORF for 5R69-17-2 was isolated by screening a cDNA library 
using a probe derived from 5R69- 17-2. 

Human H85S1 1 (SEQ ID NO:68, SEQ ID NO: 188) 

Tblastn, Smith-Waterman and blastn analyses using cDNA databases revealed an 
extension to encompass full-length ORF for H8581 1. The full ORF was reconstructed 
from IncyteESTs 202971.8, 034583.3 and 034583.1 and public ESTs H8581 1 and 
AI570599. 

Human R43524 (SEQ ED NO:73, SEQ ED NO: 192) 

Blastn analysis revealed an extension to encompass the complete catalytic region 
and the C-terminus of R43524. Since the initial filing of this application, the partial 
R43524 sequence appeared in the public database as the full-length gene encoding the 
heme-regulated initiation factor 2-aIpha kinase (HRI) (AF1 8 1 07 1 ). 

Human AA088547 (SEQ ID NO:78, SEQ ID NO:197) 
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Gcnewise and Gencscan analyses of genomic databases revealed an extension to 
encompass the complete ORP for AA0S8547. 

Human orthologue of AA13947S m (SEQ ID NO:80, SliQ ID NO: 1 99) 

Tblastn identified the Incyte 21 1475.1 as the potential full-length human 
orthologue of munne AA139478 

Human AA232253 (SEQ ID NO:82, SEQ ID NO:201) 

The full-length ORF for AA232253 was isolated by screening a cDNA library 
using a probe derived from AA232253. Since the initial filing of this application, the 
AA232253 sequence appeared in the public datab ase as the full-length gene encoding SLK 
(AB01 1422). SLK is a stress-regulated mixed lineage kinase-Iike protein that activation 
of Rac and induction of apoptosis, cDNA library screening revealed multiple isoforms for 
this gene which are summarized in the Table below. 

Table 9. Isoforms for AA232253 



Kestrl 
Name 


Kestrl AA 
Acc# 


Isoform 
type 


Description* 


MLK4 


AA232253 


MLK4 


Substitution of C for W at 346 






MLK4B 


Different Cterm (332-800); seq in MLK4B is as 
shown in * 



* C-terminus specific to MLK4B 

LPLAJUIMSEESYFESKTEESNSAEMSCQITATSNGEGHGMNPSLQAMMLMGFGDI 
FSMNKAGAVMHSGMQINMQAXQNSS 

KTTSKJIRGKKVNMALGFSDFDLSEGDDDDDDDGEEEDNDMDNSE 
Human H97685 (SEQ ID NO:84, SEQ ID NO:203) 

Blastn analysis revealed an extension to encompass the full-length ORF for 
H97685. The full ORF was reconstructed from Incyte 474824. 1 and the public ESTs 
H97685 and M62021. 

Human AI052250 (SEQ ID NO:87. SEQ ID NO:206) 
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Blastn analysis re\ calcd an extension to encompass the full-length ORF for 
A1052250. The full ORF w as reconstructed from Incyte 396S6S.1, the public partial 
cDNA FLJ 1 0074 (minus nitron) and the public ESTs and the public ESTs AI05225O and 
H976S5, AI499220 and M62021. 
5 Human AA27SS42 (SEQ ID NO:8S, SEQ ID NO:206) 

A nearly full-length cDNA (FL4F12) for AA278842 was isolated by screening a 
cDNA library using a probe derived from AA278842. A full-length virtual ORF was 
generated using FL4F12 and AA278842. 

Human AA599286 (SEQ ID NO:89, SEQ ID NO;208) 
10 Since the initial filing of this application, the partial AA599286 sequence appeared 

in the public database as a full-length ORF (AK000342). 

Human AA425725 (SEQ ID NO:90, SEQ ED NO:209) 

Since the initial filing of this application, the partial AA425725 sequence appeared 
in the public database as MSSK1, a senne kinase gene located from human chromosome 
15 Xq28. 

Human SGK022 orthologue of AA060026_m (SEQ ID NO:91, SEQ ID NO:210) 
/isdocid <mo ooi3w. Tblastn, Smith-Waterman and blastn analyses of cDNA and genomic databases 

databases revealed a potential human orthologue for murine AA060026. The full-length 
ORF for SGK022 was reconstructed from genomic locus AC022307 
20 Human AA399669 (SEQ ID NO:93, SEQ ID NO:2 1 2) 

Blastn analysis revealed an extension to encompass the full-length ORF for 
AA399669. The full ORF was reconstructed as follows: sequence 1-1007 from 
AL136295.2; sequence 1008-23 19 from AA399669 and Incyte 428177.1. 
Human AA883975 (SEQ ID NO:95, SEQ ED NO:214) 
25 Genescan and Genewise analyses of the genomic databases revealed an extension 

for AA883975 to encompass the full-length ORF 

Human AA905446 (SEQ ID NO:96, SEQ ID NO:215) 

Tblastn, Smith-Waterman and blastn analyses of cDNA and genomic databases 
databases revealed an extension for AA905446 to encompass the full-length ORF. For the 
30 Smith-Waterman analysis murine STK22 ( NP 033462) was used as the closest 

orthologue. Contig formation: range 162133-163687 from HGP_h 692 1 333_9; removed 
intron (146-893) predicted from blastx analysis. 
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Human H29974 (SEQ ID NO: 97 SEQ ID NO:2 1 6) 

Blastn analysis revealed an extension to encompass a complete catalytic ORE for 
AA399669. The nearly full-length ORE was reconstructed using Incyte 2 \ 3829. 1 and 
H29974. 

Human AA21531 1 (SEQ ID NO:99, SEQ ID NO:218) 

Blastn analysis revealed an extension to encompass the full-length ORP for 
AA21531. The full ORE was reconstructed from Incyte 067584.1, 022456.1, AA21531 ! 
and the reverse complement of CPG_043208. 

Human AA018361 (SEQ ID NO: 100, SEQ ID NO:219) 

The full-length ORF for AA0 18361 was isolated by screening a cDNA library 
using a probe derived from AA018361 . This yielded clone Sug4-30. Clone Sug4-30, like 
multiple, independent cDNA clones contained a 181 bp intron. The existence of intron-less 
RNA's was confirmed by a PCR reaction that generated a product that upon sequence 
analysis skipped the intron region. The full-length virtual ORE for AA018361 was 
generated through a contig between AL1 17482 (seq 1-367) and the sequence for clone 
Sug4-30. 

Human orthologue of AA396601__m (SEQ ID NO: 106, SEQ ED NO:225) 
tBlastn and Smith-Waterman analyses of genomic sequence revealed an extension 
to encompass the full catalytic region for the human orthologue of AA396601 . The ORF 
was reconstructed from Incyte 018653.9 (7261449H1, 6891 740J1) and genomic sequence 
CPGJMOOIO. 

Human orthologue of AA671275_m (SEQ ID NO: 108. SEQ ID NO:227) 

Since the initial filing of this application, a potential human orthologue for murine 

AA671275 appeared in the public database as the full-length ORF for vaccinia related 

kinase 3 (BAA90769). 

Human H05721 (SEQ ID NO: I 1 1, SEQ ID NO:230) 

Genescan and Genewise analyses of genomic sequence revealed an extension to 
encompass the full-length ORF for H05721 . 

Human AI086865 (SEQ ID NO: 1 1 2, SEQ ID NO:23 1 ) 

Genescan and Genewise analyses of genomic sequence revealed an extension to 
encompass the full-length ORF for AI086865. The full-length ORF was reconstructed 
from Celera 17000102901516, Incyte 243269.1 and public AL1377531. 
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Human AAS3634S (SEQ ID NO:l 13, SEQ ID NO:232) 

Gcnescan and Genewise analyses of genomic sequence re\'ealed an extension to 
encompass the full-length ORF for AAS3634S. 

Human R86668 (SEQ ID NO: 14, SEQ ID NO:233) 

The full-length ORF for R8666S was isolated by screening a cDNA library using a 
probe derived from R86668. Since the initial filing of this application, the R8668 sequence 
appeared in the public database as the full-length gene mitogen-activated protein kinase 
kinase kinase 6 (MAP3K6) (NM_00467). 

Human 2R41-9-4 (SEQ ED NO: 16, SEQ ED NO:235) 

The full-length virtual ORF for 2R4 1-9-4 was generated using genomic sequence 
to provide the Nterminus for the partial ORF predicted from clone 2R4 1-9-4 

Table 10. Sequences deleted from the provisional patent due to duplication with other 
genes in the patent 



Prov. SEQ ID NO: (na) 


Prov. SEQ ID NO: (aa) 


160 


196 


213 


214 


! 215 

i 

1 


216 


122 


126 


119 


123 


148 


184 


4 


20 


7 


23 


205 


206 


14 


30 


15 

- — - . — — ■ 


31 


35 


56 


42 


63 


[— ■ J1 1 


72 


44 


65 
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78 


92 ! 


79 


93 


80 


94 ; 

i 


157 


191 1 

i 



Results 

Table 1 documents the results from the analysis of the nucleic acid sequence data. 
From left to right the data presented is as follows. "Gene name 1 refers to the EST or PCR 
5 fragment that defined the novel kinase. "Species' 7 refers to the organism the sequence was 

derived from. "ID#" refers to the nucleic acid and amino acid sequence ED number 
designation from this patent. "Kinase family "and "Kinase group" refers to the protein 
kinase classification defined by sequence homology and based on previously established 
phylogenetic analysis [Hardie, G. and Hanks S. The Protein Kinase Book, Academic Press 

10 (1995) and Hunter T. and Plowman, G. Trends in Biochemical Sciences (1977) 22:18-22 

and Plowman G.D. et al (1999) Proc. Natl. Acad. Sci. 96:13603-13610)]. "ORF Start", 
"ORF End", "ORF Length" refer to the open reading frame range and length as calculated 
by standard nucleic acid translation programs such as MapDraw (DNAStar). "DNA 
Repeats" refers to regions of low complexity sequence or repetitive elements such as Alu, 

15 LINE, SINE, and LTR sequences. The chromosomal location (CHR localization) for 37 

of the 1 10 novel protein kinases is shown on Table 1 (NA, not available). The methods 
for determining chromosomal position are outlined below, in Example 2. 

Table 2 documents the results from the analysis of the amino acid sequence data. 
From left to right the data presented is as follows. "Gene name" refers to the EST or PCR 

20 fragment that defined the novel kinase. "Species" refers to the organism the sequence was 

derived from. "ID#" refers to the nucleic acid and amino acid sequence ID number 
designation from this patent. "Kinase family "and "Kinase group" refers to the protein 
kinase classification defined by sequence homology and based on previously established 
phylogenetic analysis [Hardie, G. and Hanks S. The Protein Kinase Book, Academic Press 

25 (1995) and Hunter T. and Plowman, G. Trends in Biochemical Sciences (1977) 22:18-22 

and Plowman G.D. et al (1999) Proc. Natl. Acad. Sci, 96:13603-13610)]. "nraa Score". 
"ID match aa \ "Identity", "Similar", "nraa Match Acc£", Description" refer to the data 
obtained usin° a Smith-Waterman search of the amino acid sequence against the non- 
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redundant protein database (Matrix PamlOO; yap open extension penalties 14 1) "Kinase 
Domain Stan", "Kinase Domain End", "Profile Start 1 ' and "Profile End" refer to data 
obtained using a Hidden-Markov Model to define catalytic range boundaries. The profile 
has a length of 261 amino acids, corresponding to the complete protein kinase catalytic 
5 domain. Proteins in which the profile recognizes a full length catalytic domain have a 

"'Profile Start" of 1 and a "Profile End" of 261. The boundaries of the catalytic domain 
within the overall protein are noted in the "Kinase Domain Start" and "Kinase Domain 
End" columns. 

10 The following abbreviations were used for kinases: 



ASK 


Apoptosis signal-regulating kinase 


CaMK 


Ca2+/'calmodulin-dependent protein kinase 


CCRX 


Cell cycle-related kinase 


CDK 


Cychn-dependent kinase 


CK 


Casein kinase 


DAPK 


Death-associated protein kinase 


DM 


myotonic dystrophy kinase 


Dyrk 


dual-specificity-tyrosine phosphorylating-regulated kinase 


GAK 


Cyclin G-associated kinase 


GRK 


G-protein coupled receptor 


GuC 


Guanylate cyclase 


HIPK 


Homeodomain-interacting protein 


IRAK 


Interleukin-1 receptor-associated kin 


MAPK 


Mitogen activated protein kinase 


MAST 


Micotubule-associated STK 


MLCK 


Myosin-light chain kinase 


MLK 


Mixed lineage kinase 


NIMA 


NimA-related protein kinase 


PKA 


cAMP-dependent protein kinase 


RSK 


Ribosomal protein S6 kinase 


RTK 


Receptor tyrosine kinase 
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SGK Serum and glucocorticoid-regulated kinase 

STK serine threonine kinase 

ULK UNCo 1 -like kinase 

The following abbreviations were used for speeies 



H 


Human 


M 


Murine 


R 


Rat 


FV 


Fowlpox virus 


MT 


M. thermoautotrophicum 


CE 


Caenorhabditis elegans 


DM 


Drosophila melanogaster 


OS 


Oryza sativa 


SP 


Schizosaccharomyces pombe 


TP 


Tetrahymena pyriformis 


PI 


Petunia inflata 


NC 


Neurospora crassa 


MSV 


Medicaeo sativa 


MSV 


Moloney murine sarcoma virus 


SA 


Squalus acanthias 


CS 


Cucumis sativus 


GM 


Glycine max 


LL 


Lilium longiflorum 


TV 


Trichomonas vaginalis 


MP 


Mycoplasma pneumoniae 


DD 


Dictyostelium discoideum 


SC 


Saccharomyces cerevisiae 


MT 


Methanobacterium thermoautotrophicum 
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D omain and Motif Identification 

A Hidden Markov model (HMM) (Krogh, A., Brown, M., Mian, 1. S., Sjolander, 
K., and Haussler, D. (1994). Hidden Markov models in computational biology: 
Applications to protein modeling. J. Mol. Biol., 235: 1501-1531) was used to identify, 
both catalytic and extracatalytic domains. Table 4 shows extra-catalytic domains that 
were identified using the HMM program. Other domains such as coiled-coil and pest 
motifs were identified as described next. 

Potential coiled-coil domains were identified using the COILS program 
(www. ch.embnet.org/software/COILS_form. html). The matrix used was MTIDK with 
windows of 14, 21, 28 amino acids. Only regions scoring 0.5 or higher were considered to 
have potential coiled-coil domain region. 

Protein sequences containing potential pest motifs were identified using the 
program PESTfind (wAvw.at.embnet.org/embnet/tools/bio/PESTfind/). PEST regions in 
proteins are by definition sequences that tend to be rich in proline, glutamic or aspartic 
acid, argininine and histidine; they have been associated with increased protein turnover 
rates (Rogers S. et al (1986) Science 234, 364-368. The algorithm defines PEST 
..sequences as hydrophilic stretches of amino acids greater than or equal to 12 residues in 
length. Such regions contain at least one P, one E or D and one S or T. They are flanked 
by lysine (K), argimne (R) or histidine (H) residues, but positively charged residues are 
disallowed within the PEST sequence. PESTfind produces a score ranging form about -50 
to +50. By definition, a score above zero denotes a possible PEST region; a value greater 
than +5 defines a high probability that there is a PEST domain. 
Identification of potential coiled-coil domains and PEST domains in N34132 

Potential coiled-coil domains were identified in N34132 (SEQ ID NO: 183) using 
the COILS program. Only regions scoring 0.5 or higher were considered to have potential 
coiled-coil domain region. The amino acid positions within N34231 scoring for potential 
coil-coil regions are shown below. 
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Table 1 1 coiled-coil domains predicted for N34132 



Coiled-coil Region 



Amino acid range j Length (aa) 




Potential PEST domains were identified in N34132 using PESTfind, a value 
greater than +5 defines a high probability that there is a PEST domain. The amino acid 
positions within N34132 scoring for potential PEST regions are shown below. 



Table 12 Potential Pest domains identified in N34132 



PEST Region 


Score 


Amino acid range 


Amino Acid Length 


1 


+ 4.91 


54-95 


42 


2 


+11.4 


537-570 


34 


3 


+31.08 


1293-1304 


12 


4 


+ 10.15 


1543-1565 


23 


5 


+ 6.17 


1698-1732 


35 



EXAMPLE 2. Chromosomal Localization of Novel Mammalian Protein Kinases 
Materials and Methods 

Several sources were used to find information about the chromosomal localization 
of each of the genes described in this patent. First, the accession number for the nucleic 
acid sequence was used to query the Unigene database. The site containing the Unigene 
search engine is: http://ww r w.ncbi.nlm.nih.gov/^niGene/Hs.Home.html. Information on 
map position within the Unigene database is imported from several sources, including the 
Online Mendelian Inheritance in Man (OMIM, 

http://www.ncbi.nlm.nih.gov/Omim/searchomim.html), The Genome Database 
(http://gdb.infobiogen.fr/gdb/simpIeSearch.htmI), and the Whitehead Institute human 
physical map (http://carbon.wi.mit.edu: 8000/cgi-bin/'contig/sts_info?database=release). 
For example, searching Unigene with W56561, an EST for a MAK-like kinase, the 
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following information is retrieved: Chr. 14, D14S65-qTEL. The location of this gene on 
an "ideogram'* of the cytogenetic map of chromosome 14 is also provided, showing that 
W56561 maps to the bottom of chromosome 14, between 14q31 and 14qTel. IfUnigene 
has not mapped the EST. then the nucleic acid for the gene of interest is used as a query 
5 against databases, such as dbsts and htgs (descnbed at 

http^/wu^.ncbi.nlm.nih.gov^LAST/blast databases.html) containing sequences that 
have been mapped already The nucleic acid sequence is searched using BLAST-2 at 
NCBI (http://www. ncbi.nlm.nih.gov/cgi-bin/BLAST/nph-newblast) and is used to query 
either dbsts or htgs. In addition to the Whitehead and GDB sites mentioned above, 
10 Stanford University maintains a useful site for chromosomal mapping from STS data 

(http://www-shgc. stanford.edu/RH/rhserverforrnnew. html). Matches in htgs are often 
resolved immediately because the genomic region hit is annotated in the htgs entry. If an 
exact match match is found (defined roughly as 99% identity over a region of about 100 
base pairs or longer, excluding any repetitive sequence), then the mapped position of the 
1 5 entry in the database is assigned to the original kinase query. Once a cytogenetic region 

has been identified by one of these approaches, disease association is established by 
( searching OMIM (see above for URL) with the cytogenetic location. OMIM maintains a 
searchable catalog of cytogenetic map locations organized by disease. A thorough search 
of available literature for the cvtogenetic region is alo made using Medline 
20 (http://ww\v. ncbi.nlm.nih.gov/PubMed/medhne. html). References for association of the 

mapped sites with chromosomal abnormalities found in human cancer can be found in: 
Knuutila, ei al, Am J Pathol, 1998, 152:1107-1123. 



Results 

25 The chromosomal location for 37 of the 1 10 novel protein kinases is shown on 

Table 1. Three of the novel protein kinases were mapped to regions associated with 
cancer amplicons, as shown on this table. The regions were also cross-checked with the 
Mendelian Inheritance in Man database, which tracks genetic information for many human 
diseases, including cancer. References for association of the mapped sites with 

30 chromosomal abnormalities found in human cancer can be found in: Knuutila, et ah, Am J 

Pathol, 199S, 152:1 107-1 123 Association of these mapped regions with other diseases is 
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documented in the Online Mendelian Inheritance in Man (OMIM ) 
(http://www.ncbi.nlm.nih.gov/htbin-post/Omim). 

EXAMPLE 3: Generation of Specific Immunorcagents 
5 Materials and Methods 

Peptide sequences to extra-catalytic regions of novel kinases are chosen which are 
not homologous to other known kinases based on a Smith Waterman homology search 
against the non-redundant protein database and predicted to be antigenic based on the 
DNAStar Protean program. These peptides are conjugated to KLH using Glutaraldehyde. 
10 Rabbits are immunized with the KLH-peptide conjugates by four injections three 

weeks apart. The rabbits are bled ten and fourteen days following the third injection and 
bled out ten days after the fourth. The serum is checked against the peptide by EL1SA. 



Table 13. Peptides to be used as immunogens for raising antibodies 



Clone 
Name 


SEQ ID 
NO (aa) 


Peptide Sequence 


Amino Location 


AA8256850 


124 


KSRDNSRDSSQSEND 


339-353 


■■■i ... — 




TEKLKRSQDLPREPLP 


372-386 






RGWRPYDIHS 


223-232 


5R79-46-1 


126 


FEGPRRNKEVMYK 1 224-236 






KDD YNET VHKKTE j 45 1 -463 






GTHPKDRNVEKJLQ 


541-553 






EVSKYQEYTNELQET 


643-657 


AA256100 


129 


IDDTSNFDDFPESDI 


405-419 




! TEPDYKSKDWVFL 

! 1 


427-439 







EEKKLRRSQHARKET j 61-75 


AA2 10825 


130 


SNKDTLRKRHYWRLD | 507-521 






RHTTRKSSTTLRE j 488-500 






FQNNTTNRYYKEIPL \ 528-542 

! 


1 

i 1 


GKHRKTGRDVAVK j 668-680 

i 


! 1 FPTKQESQLRNE ' 687-698 

! ! : 1 
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A A3 1 6804 




ESHYHQEPSKRIPS 


239-252 






HTKRKSSTMYKEGW 


-j — ■ ■ j 

j 409-422 


>- — . — ~ — - - — ■ — - — 




PSDLDVERDEEAVK 


375-388 1 


| i SPGQGKDHKDLSTSI 

I : 

i 


543-557 


R47805 


143 


EPVGRWDQDYDRAVL 


44-58 






KPKGPGGKRGHKRLI 


325-339 ! 

i 






PTDVAQLPSRVPRDA 


219-233 1 


AA234451 


167 


DPFDWEKTGNDGSLT 


293-307 






HPRPQEKDVWEE 


374-385 


j 




RENTDEVFPDEQLSD 


340-354 


i 
1 




RSEITQPDRDIPLVR 


427-441 


AA460132 


180 


LKSYSTSSKKARPVL 


222-236 


J 

I 

! ! 


KKLDEVRLRGRKRSM 


237-251 

i 






ETEKTAQGLSNLAKT 


131-145 


N34132 


183 


SGRRRRPTKSKGSKS 


1848-1862 




PGTAPSKPPLTKAPV 


1474-1488 






VDSDTQPKAPGIDD 


1365-1378 






AHSLDKTSHSSTTGL 


1253-1267 


5R69-17-2 


187 


GTTREKTDRVKST 


178-190 






HSEAPELHGKJRSSN 


138-152 






DETVTPPQFSrV 


87-98* 






QYDVKSEIYS 


204-213 


AA278842 


206 


TVDPEKSVRDQAFKA 


515-529 






DSSTADRWDDEDWGS 


637-651 


i " 




SVSEDPTQLEEVEKD 


539-553 


AA836348 


232 


NAPTKRPRSSTVTEA 


323-337 


| 


i 

... _i 


LDSEEDYYTPQKVDV 


514-528 


i 
» 


j GDKASYRQPKHVEICL 


409-423 
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EXAMPLE 4. Expression analysis of Novel Mammalian Protein Kinases 
GENE EXPRESSION ANALYSIS 
Tissue Arrays 

"cDNA libraries" derived from a variety of sources were immobilized onto nylon 
membranes and probed with 32P-iabeled cDNA fragments derived from the gene(s) of 
interest. 

Total RNA or mRNA was used as template in a reverse transcription reaction to 
generate single-stranded cDNAs (ss cDNA) that were tagged with specific sequences at 
each end. An oligo dT primer containing a specific sequence (CDS: 
AAGCAGTGGTAACAACGCAGAGTACT30VN (V=A,G,C N=A,G,C,T)) anneals at 
the polyA track at the 3 1 end of the mRNA and the reverse transcriptase (MMLV 
RnaseH-) transcribes the antisense strand until it reaches the end of the RNA strand when 
it adds additional C residues. If a primer (SMIL 
AAGCAGTGGTAACAACGCAGAGTACGCGGG or ML2G: 

AAGTGGCAACAGAGATAACGCGTACGCGGG) ending with 3 Gs is added, it anneals 
to the added Cs and the MMLV recognizes the rest of the primer sequence as template and 
continues transcription. As a result, the synthesized cDNAs contain specific sequence tags 
at both the 5' and the 3' end. When the 5 1 and the 3' ends are tagged with the same 
sequence (CDS and SMII) it is referred to as "symmetric. " When the 5' end is tagged 
with a different sequence than the 3' end (CDS and ML2G) is referred to as "asymmetric" 
A double-stranded "cDNA library " is then generated by PCR amplification using the 
3'PCR and ML2 primers (3' PCR: AAGCAGTGGf AACAACGCAGAGT and ML2: 
AAGTGGCAACAGAGATAACGCGT) that anneal to the added sequence tags. 

The amplified "cDNA libraries" were manually arrayed onto nylon membranes 
with a 384 pin replicator. The DNA was denatured by alkali treatment, neutralized and 
cross-linked by UV light. The arrays were pre-hybridized with Express Hyb (Clontech) 
and hybridized with 32P labeled probes generated by random hexamer priming of cDNA 
fragments corresponding to the genes of interest. After washing, the blots were exposed to 
phosphonmaging cassettes and the intensity of the signal was quantified. The amount of 
the DNA on the arrays was also quantified by treating non-denatured or denatured arrays 
with Syber Green I or Syber Green II respectively (1:1 00,000 in 50mM Tns, pHS.0) for 2 
minutes. After washing with 50mM Tris, pH8.0. the fluorescent emission was detected 
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with a phosphonmager (Molecular Dynamics) and quantified. The amount of the arrayed 
DNA was used to normalize the hybridization signal and the corrected values are tabulated 
in Table 3. 

Results 

The results of the microarray expression analysis of the protein kinases presented 
in this application is shown in Table 3. Data presentation from left to right is as follows: 
'Tissue": tissue type of the cDNA; 'Tumor sym", indicates that the tissue is derived from 
a tumor, "sym" refers to the fact that the 5' and 3 1 primers used to make the sample are the 
same; "Normal Syrrf \ indicates normal tissue was used to make the sample, with 
symmetric primers as described above; "Tumor lo", indicates that primary tumor tissue 
was used to make the cDNA; 'Tumor cells", indicates that these cDNA samples were 
made from cultured tumor cells; "Normal", indicates that these samples are derived from 
normal tissue or cell lines; "Endos", indicates that these samples are derived from 
endothelium-related tissue sources; "p53" refers to the status, mutant or wild-type, of the 
p53 gene in the source samples. Normalized expression values are presented for each 
mgene referred to by its SEQ ED# on the subsequent columns. Genes represented in 
expression Table 3 are: SEQ ID NO:3 (AA826850), SEQ ID NO:5 (TBK1), SEQ ID NO:6 
(AA305176), SEQ ED NO:S (AA256100), SEQ ID NO:9 (CAB43292), SEQ ID NO:l 1 
(EPK2), SEQ ID NO: 12 (PKNbeta), SEQ ID NO: 14 (HI 91 02), SEQ ID NO:16 (RSK4), 
SEQ ID NO: 17 (AAD301S2), SEQ ID NO:20(SGK2), SEQ ID NO:22jfPTK9L), SEQ ID 
NO:26 (AA383293), SEQ ID NO:29 (DRAK2), SEQ ID NO:31 (DRAK1), SEQ ID 
NO:032 (AA015726), SEQ ID NO:40 (MAK-V), SEQ ID NO:044 (TRAD), SEQ ID 
NO:044 (TRAD), SEQ ID NO:45 (AA454060), SEQ ID NO:47 (AA234451), SEQ ID 
NO:48 (AA436054), SEQ ID NO:49 (AA626859), SEQ ID NO:51 (KIAA0904), SEQ ID 
NO:52 (AA789239), SEQ ID NO:54 (CCRK), SEQ ID NO:55 (CLK4), SEQ ID NO:56 
(AA557536), SEQ ID NO:57 (W56561 ), SEQ ID NO:60 (AA579641 ), SEQ ID NO:63 
(NEK7), SEQ ID NO:66 (CAMKKB), SEQ ED NO:68 (HIPK2), SEQ ID NO:72 
(R 19609), SEQ ID NO:73 (HRI), SEQ ID NO:78 (AA088547), SEQ ID NO:79 
(AA449542), SEQ ID NO:082a (MLK4), SEQ ID NO:82 (MLK4b), SEQ ID NO:84 
(RIP4), SEQ ID NO:88 (AA278842), SEQ ED NO:89 (AA195964), SEQ ID NO:90 
(MSSK1), SEQ ID NO:93 (TSK4), SEQ ID NO:94 (AI025291), SEQ ID NO:95 
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(AA948538), SEQ ID NO:96 (AA905446), SEQ ID NO:97 (H85389), SEQ ID NO: I 00 
(AA018361), SEQ ID NO: 101 (AA31 1714), SEQ ID NO: 1 1 0 ( AA452647), SEQ ID 
-NO: II 1 (AA310219), SEQ ID NO: 1 1 2 ( AI086865). SEQ ID NO 1 14 (MEKK6), and SEQ 
ID NO:l 16 (SuRTK106). 

EXAMPLE 5. Kinase assays for Erk. JNK1 and p38 MAP kinases 

293T cells were transiently transfected with HA- p38 or co-transfected with Fla~- 
tagged wt MLK4A, kinase-dead MLK4A, wild-type MLK4B or kinase-dead MLK4B 
using Lipofectamine 2000 (Lifetech). Cells were lysed 36 hr post-transfection. Cell 
lysates normalized to contain equivalent amounts of HA-p38 were immunoprecipitated 
with anti-HA antibody (Mab HA-1 1, Babco). Immunoprecipitates were split in two 
portions, one portion was Western-blotted with anti- HA antibody and the other with a 
phospho-specific p38 antibody (Promega) to detect activated levels of p38. Activation of 
Erkl and Jnkl was measured similarly. (This example applies to AA232253 (SEQ ID 
NO:82, SEQ ED NO:201).) 

Results : 

Ln transient assays wild-type MLK4A and MLK4B (but not kinase-inactive 
MLK4A(K45M) or MLK4B(K45M)) activate Erk, JNK1 and p38 MAP kinases. 
EXAMPLE 6. RAC1 guanine-exchange factor assay 

293T cells were transiently transfected with HA-Racl or co-transfected with Flag- 
tagged Duet C, Duet E, Dbl and HA-Tiam-L Cells were lysed 36 hour post-transfection. 
Cell lysates normalized to contain equivalent amounts of Racl were affinity precipitated 
with immobilized GST-PBD (p2 1 -binding domain of Pak3). Bound proteins were 
Western blotted and probed with anti-HA antibody to detect levels of activated Racl . 
((This example applies to R 199772 (Trad/Duet)(SEQ ID NO:44, SEQ ID NO: 164).) 

Results : 

Duet C and Duet E both act as guanine nucleotide exchange factors on Racl. 
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CONCLUSION 

One skilled in the art would readily appreciate that the present invention is well 
adapted to carry out the objects and obtain the ends and advantages mentioned, as well as 
those inherent therein. The molecular complexes and the methods, procedures, treatments, 
molecules, specific compounds described herein are presently representative of preferred 
embodiments are exemplary and are not intended as limitations on the scope of the 
invention. Changes therein and other uses will occur to those skilled in the art which are 
encompassed within the spirit of the invention are defined by the scope of the claims. 

It w ill be readily apparent to one skilled in the art that varying substitutions and 
modifications may be made to the invention disclosed herein without departing from the 
scope and spirit of the invention. 

All patents and publications mentioned in the specification are indicative of the 
levels of those skilled in the art to which the invention pertains. 

The invention illustratively described herein suitably may be practiced in the 
absence of any element or elements, limitation or limitations which is not specifically 
.^disclosed herein. Thus, for example, in each instance herein any of the terms 
"comprising", "consisting essentially of 1 and "consisting of" may be replaced with either 
of the other two terms. The terms and expressions which have been employed are used as 
terms of description and not of limitation, and there is no intention that in the use of such 
terms and expressions of excluding any equivalents of the features shown and described or 
portions thereof, but it is recognized that various modifications are possible within the 
scope of the invention claimed. 

In particular, although some formulations described herein have been identified by 
the excipients added to the formulations, the invention is meant to also cover the final 
formulation formed by the combination of these excipients. Specifically, the invention 
includes formulations in which one to all of the added excipients undergo a reaction 
during formulation and are no longer present in the final formulation, or are present in 
modified forms. 

In addition, where features or aspects of the invention are described in terms of 
Markush groups, those skilled in the art will recognize that the invention is also thereby 
described in terms of any individual member or subgroup of members of the Markush 
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group. For example, if X is described as selected from the group consisting of bromine 
chlorine, and iodine, claims for X beinu bromine and claims for X bcinL> bromine and 
chlorine are fully described. 

Other embodiments are within the following claims. 
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W hat is claimed is: 

CLAIMS 

] An isolated, enriched, or purified nucleic acid molecule encoding a kinase 

polypeptide selected from the group consisting of SEQ ID NO: 122, SEQ ID NO: 123. SI:Q 
ID NO:124. SEQ ID NO:125, SEQ ID NO.126. SEQ ID NO:127, SEQ ID NO:12S, SEQ 
ID NO: 129. SEQ ID NO. 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133. SEQ 
ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ 
ID NO:139, SEQ ID NO:140, SEQ ID NO:141. SEQ ID NO:142, SEQ ID NO:143. SEQ 
ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ED NO: 147, SEQ ID NO: 148, SEQ 
ID NO: 149, SEQ ID NO: 150, SEQ ID NO.151, SEQ ID NO: 152, SEQ ID NO. 153, SEQ 
ID NO: 1 54. SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID NO: 157, SEQ ID NO: 1 58, SEQ 
ID NO:159, SEQ ID NO:160, SEQ ID NO:161, SEQ ID NO:162, SEQ ID NO:163, SEQ 
ID NO: 164, SEQ ED NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, SEQ 
ID NO: 169, SEQ ID NO: 170, SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, SEQ 
ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ 
ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 181, SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ 
ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID NO: 187, SEQ ID NO: 188, SEQ 
ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 191, SEQ ID NO: 1 99, SEQ ID NO: 1 93, SEQ 
ID NO: 194. SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, SEQ 
ID NO: 199. SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203. SEQ 
ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ 
ID NO:209, SEQ ID NO:210, SEQ ED NO:211, SEQ ED NO:212, SEQ ID NO:213, SEQ 
ID NO:214, SEQ ED NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ 
ID NO:219, SEQ ID NO:220, SEQ ED NO:221, SEQ ED NO:222, SEQ ID NO:223, SEQ 
ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ED NO:227, SEQ ID NO:228, SEQ 
ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ 
ED NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ED NO:237, SEQ ID NO:238. SEQ 
ID NO:239. SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242. 
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2. The nucleic acid molecule of claim 1, wherein said nucleic acid molecule 
comprises a nucleotide sequence that: 

(a) encodes a polypeptide comprising the amino acid sequence set forth 
in SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID 
5 NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129. SEQ ID NO: 130, SEQ ID 

NO:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID 
NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID 
NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID 
NO:146, SEQ ID NO: 147, SEQ ED NO:148, SEQ ID NO:149, SEQ ID NO:l 50, SEQ ID 

10 NO:151, SEQ ED NO:152, SEQ ED NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID 

NO: 1 56, SEQ ID NO: 1 57, SEQ ED NO: 1 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID 
NO:161, SEQ ID NO:162, SEQ ED NO:163, SEQ ED NO:164, SEQ ID NO:165. SEQ ID 
NO: 166, SEQ ED NO: 167, SEQ ED NO: 168, SEQ ED NO: 169, SEQ ID NO: 170, SEQ ID 
NO: 171, SEQ ED NO: 172, SEQ ED NO: 173, SEQ ED NO: 174, SEQ ED NO: 175, SEQ ID 

1 5 NO: 1 76, SEQ ID NO: 1 77, SEQ ED NO: 1 78, SEQ ED NO: 1 79, SEQ ID NO: 1 80, SEQ ID 

NO: 1 8 1 , SEQ ED NO: 1 82, SEQ ED NO: 1 83, SEQ ED NO: 1 84, SEQ ED NO: 1 85, SEQ ID 
NO: 1 86, SEQ ED NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID 
NO:191, SEQ ED NO:199, SEQ ED NO:193, SEQ ED NO:194, SEQ ID NO:195, SEQ ID 
NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID 

20 NO:201 , SEQ ID NO:202, SEQ ID NO:203, SEQ ED NO:204. SEQ ID NO:205, SEQ ID 

NO:206, SEQ ID NO:207, SEQ ED NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID 
NO:21 1, SEQ ID NO:212, SEQ ED NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID 
NO:216, SEQ ED NO:217, SEQ ID NO:218, SEQ ED NO:219, SEQ ID NO:220, SEQ ID 
NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ED NO:224, SEQ ID NO:225, SEQ ID 

25 NO:226, SEQ ED NO:227, SEQ ED NO:228, SEQ ED NO:229, SEQ ID NO:230, SEQ ID 

NO:231, SEQ ED NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID 
NO:236, SEQ ED NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID 

NO:241, and SEQ ID NO:242, 

(b) is the complement of the nucleotide sequence of (a), 
30 (c) hybridizes under highly stringent conditions to the nucleotide 

molecule of (a) and encodes a naturally occurring kinase polypeptide: 



v: 7 id G'M? I 
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(d) encodes a kinase polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO 122, SEQ ID NO: 123. SEQ ID NO 1 24 
SEQ ID NO: 125. SEQ ID NO: 126. SEQ ID NO: 127. SEQ ID N0.12S. SEQ ID NO 129. 
SEQ ID NO: 1 30, SEQ ID NO: 1 3 1 , SEQ ID NO: 132, SEQ ID NO: 133. SEQ ID NO: 1 34. 
SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, 
SEQ ID NO:140, SEQ EDN0141, SEQ IDNO:142, SEQ IDNO:143, SEQ ID NO:144, 
SEQ ID NO:145, SEQ ID NO 146, SEQ IDNO:147, SEQ IDNO:148, SEQ IDNO:149, 
SEQ ED NO: 150, SEQ ED NO:151, SEQ ED NO: 152, SEQ ID NO: 153, SEQ ED NO: 154, 
SEQ ED NO: 1 55, SEQ ID NO 1 56, SEQ ED NO: 157, SEQ ID NO: 1 58, SEQ ID NO: 1 59, 
SEQ ID NO: 1 60, SEQ ID NO 1 6 1 , SEQ ID NO: 1 62, SEQ ID NO : 1 63, SEQ ID NO: 1 64, 
SEQ ED NO:165. SEQ ID NO 166, SEQ ID NO:167. SEQ ID NO:168, SEQ ID NO:169, 
SEQ ID NO: 1 70, SEQ ID NO 1 7 1 . SEQ ID NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, 
SEQ ED NO: 1 75, SEQ ED NO 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, 
SEQ ED NO: 1 80, SEQ ED NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, 
SEQ ED NO: 1 85, SEQ ED NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ED NO: 1 89, 
SEQ ED NO: 1 90, SEQ ED NO: 1 91 , SEQ ED NO: 1 99, SEQ ID NO: 1 93, SEQ ID NO: 1 94, 
,-JSEQ ED NO: 195, SEQ ED NO: 196, SEQ ED NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, 
SEQ ED NO:200, SEQ ID NO:201, SEQ ED NO:202, SEQ ED NO:203, SEQ ED NO:204, 
SEQ ID NO:205, SEQ ED NO:206, SEQ ED NO:207, SEQ ED NO:208, SEQ ID NO:209, 
SEQ ID NO:210, SEQ ED NO:21 1, SEQ ED NO:212, SEQ ID NO:213, SEQ ED NO:214, 
SEQ ID NO:2 1 5, SEQ ED NO:2 1 6, SEQ ED NO:2 1 7, SEQ ED NO:2 1 8, SEQ ED NO:2 1 9, 
SEQ ED NO:220, SEQ ED NO:221, SEQ ED NO:222, SEQ ID NO:223, SEQ ID NO:224, 
SEQ ED NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, 
SEQ ID NO:230, SEQ ED NO:23 1 , SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, 
SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, 
SEQ ED NO:240, SEQ ID NO:241. and SEQ ID NO:242, except that it lacks one or more, 
but not all, of a domain selected from the group consisting of an N-terminal domain, a 
catalytic domain, a C-terminal domain, a coiled-coil structure region, a proline-rich region, 
a spacer region, an insert, and a C-terminal tail; 

(c) is the complement of the nucleotide sequence of (d); 
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(0 encodes a domain of an amino acid sequence selected from the 
group set forth in SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO 124. SEQ ID NO:125, 
SEQ ID NO: 1 26. SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO 129, SEQ ID NO: 130, 
SEQ ID NO:131, SEQ ID NO:132, SEQ IDNO:133, SEQ ID NO 134, SEQ ID NO:135. 
SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO 139, SEQ ID NO: 140, 
SEQ ED NO: 141, SEQ ID NO.142, SEQ ID NO:143, SEQ ID NO: 144, SEQ ID NO:I45, 
SEQ ID NO: 1 46, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 1 49, SEQ ID NO: 1 50, 
SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, 
SEQ ED NO: 1 56, SEQ ID NO: 1 57, SEQ ID NO: 158, SEQ ID NO: 1 59, SEQ ID NO: 1 60, 
SEQ ED NO: 161, SEQ ID NO: 162, SEQ ED NO: 163, SEQ ED NO: 164, SEQ ID NO: 165. 
SEQ ED NO: 166, SEQ ED NO: 167, SEQ ED NO: 168, SEQ ED NO: 169, SEQ ED NO: 170, 
SEQ ID NO: 171, SEQ ED NO: 172, SEQ ED NO: 173, SEQ ID NO: 174, SEQ ED NO: 175, 
SEQ ED NO: 176, SEQ ED NO:177, SEQ ED NO: 178, SEQ ED NO: 179, SEQ ED NO: 180, 
SEQ ED NO. 181, SEQ ED NO: 1 82, SEQ ED NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, 
SEQ ED NO: 1 86, SEQ ED NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, 
SEQ ID NO: 191, SEQ ED NO: 199, SEQ ED NO: 193, SEQ ID NO. 194, SEQ ID NO: 195, 
SEQ ED NO: 1 96, SEQ ED NO: 1 97, SEQ ED NO: 1 98, SEQ ID NO: 1 99, SEQ ID NO:200, 
SEQ ID NO:201. SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, 
SEQ ID NO:206, SEQ ED NO:207, SEQ ED NO:208, SEQ ED NO:209, SEQ ID NO:210, 
SEQ ED NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, 
SEQ ED NO:216, SEQ ID NO:217, SEQ ED NO:218, SEQ ID NO:219, SEQ ID NO:220, 
SEQ ED NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ED NO:224, SEQ ID NO:225, 
SEQ ED NO:226, SEQ ED NO:227, SEQ ED NO:228, SEQ ID NO:229, SEQ ED NO:230, 
SEQ ED NO:231, SEQ ED NO:232, SEQ ED NO:233, SEQ ED NO:234, SEQ ED NO:235, 
SEQ ED NO:236, SEQ ED NO:237, SEQ ED NO:238, SEQ ED NO:239, SEQ ED NO:240, 
SEQ ED NO:241, and SEQ ID NO:242, wherein said domain is selected from the group 
consisting of an N-terminal domain, a catalytic domain, a C-terminal domain, a coiled-coil 
structure region, a proline-rich region, a spacer region, an insert, and a C-terminal tail; 

(g) is the complement of the nucleotide sequence of (f); 

(h) encodes a polypeptide comprising an amino acid sequence selected 
from the group consisting of SEQ ID NO: 122, SEQ ID NO. 123, SEQ ID NO: 124. SEQ ID 
NO:125, SEQ ID NO:126. SEQ ID NO:127, SEQ ID NO:128. SEQ ID NO:129, SEQ ID 
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NO: 1 30. SEQ ID NO 131. SEQ ID NO: 1 32, SFQ ID NO: 1 33, SEQ ID NO: 1 34. SEQ ID 
NO: 1 35. SEQ ID NO 13o. SEQ ID NO 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID 
NO: 140. SEQ ID NO 141. SEQ ID NO: 142. SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID 
NO:145. SEQ ID NO 146. SEQ ID NO 147, SEQ ID NO 148, SEQ ID NO:149, SEQ ID 
5 NO 150, SEQ ID NO 151. SEQ ID NO: 152, SEQ ID NO 153. SEQ ID NO: 154, SEQ ID 

NO 1 55, SEQ ID NO: 1 56. SEQ ID NO: 157. SEQ ID NO: 1 58, SEQ ID NO: 1 59, SEQ ID 
NO 1 60, SEQ ID NO: 1 6 1 . SEQ ID NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, SEQ ID 
NO:165 SEQ ID NO:166. SEQ ID NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID 
NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, SEQ ID 
1 0 NO. 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID 

NO. 1 80, SEQ ID NO: 181, SEQ ID NO: 1 82, SEQ ID NO. 1 S3, SEQ ID NO: 1 84, SEQ ID 
NO: 185, SEQ ID NO: 186. SEQ ID NO: 187, SEQ ID NO. 188, SEQ ID NO: 189, SEQ ID 
NO: 1 90, SEQ ID NO: 1 9 1 , SEQ ID NO: 1 99, SEQ ID NO: 1 93, SEQ ID NO: 1 94, SEQ ID 
NO: 1 95, SEQ ID NO: 1 96, SEQ ID NO: 1 97, SEQ ED NO: 1 98, SEQ ID NO: 1 99, SEQ ID 
1 5 NO:200, SEQ ID NO:20 1 . SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID 

NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID 
^NO:210. SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ED NO:214, SEQ ED 
NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID 
NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID 
20 NO:225, SEQ ID NO:226. SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID 

NO:230, SEQ ID NO:231. SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID 
NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID 
NO:240, SEQ ID NO:241, and SEQ ID NO:242, except that it lacks one or more, but not 
all, of the domains selected from the group consisting of an N-terminal domain, a catalytic 
25 domain, a C-terminal domain, a spacer region, a proline-rich region, a coiled-coil structure 

region, and a C-terminal tail; or 

(i) is the complement of the nucleotide sequence of~(h) 
3. The nucleic acid molecule of claim 1 , further comprising a vector or 
promoter effective to initiate transcription in a host cell. 

30 
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4. The nucleic acid molecule of claim 1 , wherein said nucleic acid molecule is 
isolated, enriched, or purified from a mammal. 

5. The nucleic acid molecule of clainr4, wherein said mammal is a human 

6. A nucleic acid probe for the detection of nucleic acid encoding a kinase 
polypeptide in a sample, wherein said polypeptide is selected from the group consisting of 
SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ED NO: 125, SEQ ID NO: 126, 
SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 1 3 1 , 
SEQ ID NO:132, SEQ ED NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO: 136, 
SEQ ED NO: 137, SEQ ED NO: 138, SEQ ED NO: 139, SEQ ED NO: 140, SEQ ID NO: 141, 
SEQ ED NO: 1 42, SEQ ED NO: 143, SEQ ED NO: 144, SEQ ED NO: 1 45 , SEQ ID NO: 1 46/ 
SEQ ED NO: 1 47, SEQ ED NO: 148, SEQ ID NO: 149, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , 
SEQ ED NO: 1 52, SEQ ED NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 155, SEQ ID NO. 1 56, 
SEQ ED NO: 1 57, SEQ ED NO: 158, SEQ ED NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 161, 
SEQ ED NO: 1 62, SEQ ED NO: 1 63, SEQ ED NO: 1 64, SEQ ID NO: 1 65 . SEQ ID NO: 1 66, 
SEQ ED NO: 1 67, SEQ ED NO: 1 68, SEQ ED NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , 
SEQ ED NO: 1 72, SEQ ED NO: 1 73, SEQ ED NO: 1 74, SEQ ED NO: 1 75, SEQ ID NO: 1 76, 
SEQ ED NO: 1 77, SEQ ED NO: 1 78, SEQ ID NO: 1 79, SEQ ED NO: 1 80, SEQ ID NO: 1 8 1 , 
SEQ ED NO: 182, SEQ ED NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, 
SEQ ED NO: 1 87, SEQ ED NO: 1 88, SEQ ID NO: 1 89, SEQ ED NO: 1 90, SEQ ID NO: 1 9 1 , 
SEQ ED NO: 1 99, SEQ ED NO: 1 93, SEQ ED NO: 1 94, SEQ ED NO: 1 95, SEQ ID NO: 1 96, 
SEQ ED NO: 197, SEQ ED NO: 198, SEQ ED NO: 199, SEQ ED NO:200, SEQ ED NO:201, 
SEQ ED NO:202, SEQ ED NO:203, SEQ ED NO:204, SEQ ED NO:205, SEQ ED NO:206, 
SEQ ED NO:207, SEQ ED NO:208, SEQ ED NO:209, SEQ ED NO:210, SEQ ID NO:21 1, 
SEQ ED NO:212, SEQ ED NO:213, SEQ ID NO:214, SEQ ED NO:215, SEQ ID NO:216, 
SEQ ED NO:2 1 7, SEQ ED NO:2 1 8, SEQ ID NO:2 1 9, SEQ ED NO:220, SEQ ID NO:22 1 , 
SEQ ED NO:222, SEQ ED NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, 
SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1 , 
SEQ ID NO:232, SEQ ID NO:233, SEQ ED NO:234, SEQ ID NO:235, SEQ ID NO:236, 
SEQ ED NO:237, SEQ ED NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:24 1 , 
and SEQ ID NO:242. 
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7 The probe of claim b. wherein said polvpeptide is a fragment of the protein 
encoded by an amino acid sequence selected from the group consisting of SEQ ID 
NO: 122. SEQ ID NO: 123. SEQ ID NO: 124. SEQ ID NO: 125. SEQ ID NO: 126. SEQ ID 
NO. 127, SEQ ID NO: 1 28. SEQ ID NO: 1 29. SEQ ID NO 1 30, SEQ ID NO: 1 3 1 , SEQ ID 
NO: 132, SEQ ID NO:133. SEQ ID N0134. SEQ ID NO 135. SEQ ID NO:136, SEQ ID 
NO: 137, SEQ ID NO 1 38. SEQ ID NO: 139. SEQ ID NO: 1 40, SEQ ID NO: 141, SEQ ID 
NO:142, SEQ ID NO. 143, SEQ ID NO: 144, SEQ ED NO:145, SEQ ID NO:146, SEQ ID 
NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID NO:151, SEQ ID 
NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID 
NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID NO:161, SEQ ID 
NO:162, SEQ ID NO:163, SEQ ED NO:164, SEQ ED NO:165. SEQ ID NO: 166, SEQ ID 
NO: 1 67, SEQ ED NO: 1 68, SEQ ED NO: 1 69, SEQ ED NO: 1 70, SEQ ED NO: 171, SEQ ID 
NO: 1 72, SEQ ED NO: 1 73, SEQ ED NO: 1 74, SEQ ED NO: 1 75, SEQ ID NO: 1 76, SEQ ID 
NO: 1 77, SEQ ED NO: 1 78, SEQ ED NO: 1 79, SEQ ID NO: 1 80, SEQ ED NO: 181, SEQ ID 
NO: 1 82, SEQ ED NO: 1 83, SEQ ED NO: 1 84, SEQ ED NO: 1 85, SEQ ED NO: 1 86, SEQ ID 
NO: 1 87, SEQ ED NO: 1 88, SEQ ED NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 191, SEQ ID 
NO: 199, SEQ ED NO: 193. SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID 
NO: 197, SEQ ED NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID 
NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 
NO:207, SEQ ED NO:208, SEQ ID NO:209, SEQ ID NO:2 1 0, SEQ ID NO:2 1 1 . SEQ ID 
NO:212, SEQ ED NO:213. SEQ ED NO:214, SEQ ED NO:215, SEQ ID NO:216, SEQ ID 
NO:217, SEQ ED NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
NO:222, SEQ ID NO:223, SEQ ED NO:224, SEQ ED NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ED NO:229, SEQ ED NO:230, SEQ ID NO:23 1 , SEQ ID 
NO:232, SEQ ED NO:233, SEQ ED NO:234, SEQ ED NO:235, SEQ ID NO:236, SEQ ID 
NO:237, SEQ ED NO:238. SEQ ED NO:239, SEQ ED NO:240, SEQ ID NO:241, and SEQ 
ID NO:242. 
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8 A recombinant cell comprising a nucleic acid molecule encoding a kinase 

polypeptide selected from the group consisting of SEQ ID NO: 122, SEQ ID NO: 123, SEQ 
ID NO: 124, SEQ ID NO: 125. SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ 
ID NO: 129, SEQ ID NO: 130, SEQ ID NO. 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ 
ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 13 7, SEQ ID NO. 138, SEQ 
ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ 
ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ 
ED NO: 149, SEQ ID NO:150, SEQ ED NO:151, SEQ ED NO:152, SEQ ID NO:153, SEQ 
ID NO: 1 54, SEQ ED NO: 1 55, SEQ ID NO: 1 56, SEQ ED NO. 1 57, SEQ ID NO: 1 58, SEQ 
ED NO:159, SEQ ID NO: 160, SEQ EDNO:161, SEQ ID NO:162, SEQ ID NO: 163, SEQ 
ID NO: 164, SEQ ED NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, SEQ 
ID NO:169, SEQ ED NO:170, SEQ ID NO:171, SEQ ID NO:172, SEQ ID NO:l 73, SEQ 
ID NO: 174, SEQ ID NO: 1 75, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID NO: 178, SEQ 
ID NO: 1 79, SEQ ED NO: 1 80, SEQ ED NO: 1 8 1 , SEQ ED NO: 1 82, SEQ ID NO. 1 83, SEQ 
ID NO: 1 84, SEQ ED NO: 1 85, SEQ ED NO: 1 86, SEQ ED NO: 1 87, SEQ ID NO: 1 88, SEQ 
ID NO: 189, SEQ ED NO: 190, SEQ ED NO: 191, SEQ ED NO: 199, SEQ ED NO: 193, SEQ 
ED NO: 1 94, SEQ ID NO: 1 95, SEQ ED NO: 1 96, SEQ ED NO: 1 97, SEQ ED NO: 1 98, SEQ 
ED NO:199, SEQ ID NO:200, SEQ ED NO:201, SEQ ED NO:202, SEQ ID NO:203, SEQ 
ED NO:204, SEQ ED NO:205, SEQ ED NO:206, SEQ ED NO:207, SEQ ED NO:208, SEQ 
ID NO:209, SEQ ED NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ED NO:213. SEQ 
ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ 
ID NO:219, SEQ ED NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ 
ED NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ED NO:228, SEQ 
ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1 , SEQ ID NO:232, SEQ ID NO:233, SEQ 
ID NO:234, SEQ ED NO:235, SEQ ID NO:236, SEQ ED NO:237, SEQ ID NO:238, SEQ 
ID NO:239, SEQ ID NO:240. SEQ ID N0.241 , and SEQ ID NO:242 
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9. The cell of claim 8. wherein said pohpeptide is a fragment of a protein 
encoded by an amino acid sequence selected from the group consisting of SEQ ID 
NO:122. SEQ ID NO:123. SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID 
NO: 127. SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130. SEQ ID NO: 131, SEQ ID 
NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID 
NO:137, SEQ ID NO:13S, SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID 
NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID 
NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID NO:151, SEQ ID 
NO: 152, SEQ ID NO: 153, SEQ ED NO: 154, SEQ ED NO: 155, SEQ ED NO: 156, SEQ ID 
NO: 157, SEQ ED NO: 158, SEQ ED NO: 159, SEQ ED NO: 160, SEQ ED NO: 161, SEQ ID 
NO: 162, SEQ ID NO: 163, SEQ ED NO: 164, SEQ ED NO: 165. SEQ ED NO: 166, SEQ ID 
NO: 167, SEQ ID NO: 168, SEQ ED NO: 169, SEQ ED NO: 170, SEQ ED NO: 171, SEQ ID 
NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID 
NO: 177, SEQ ED NO: 178, SEQ ED NO: 179, SEQ ID NO: 180, SEQ ID NO: 181, SEQ ID 
NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID 
NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ ED NO:190, SEQ ID NO:191, SEQ ID 
~NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ ED NO:195, SEQ ID NO: 196, SEQ ID 
NO:197, SEQ IDNO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ED NO:201, SEQ ID 
NO:202, SEQ ED NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 
NO:207, SEQ ED NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ED NO. 21 1, SEQ ID 
NO:212, SEQ ED NO:213, SEQ ED NO:214, SEQ ED NO:215, SEQ ID NO:216, SEQ ID 
NO:217, SEQ ED NO:218, SEQ ED NO:219, SEQ ID NO:220, SEQ ED NO:221, SEQ ID 
NO:222, SEQ ED NO:223, SEQ ED NO:224, SEQ ID NO:225, SEQ ED NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ED NO:229, SEQ ED NO:230, SEQ ID NO:231, SEQ ID 
NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ED NO:235. SEQ ID NO:236, SEQ ID 
NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ 
ID NO:242. 
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10. An isolated, enriched, or purified kinase polypeptide selected from the 
group consisting of SEQ ID NO: 122, SEQ ID NO: 1 23, SEQ ID NO: 124, SEQ ID NO: 1 25, 
SEQ ID NO: 126, SEQ ID NO 127, SEQ ID NO: 128. SEQ ID NO 129. SEQ ID NO: 130, 
SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO 133, SEQ ID NO 134, SEQ ID NO: 135. 
5 SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO 139, SEQ ID NO140, 

SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144. SEQ ID NO: 145, 
SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO 149, SEQ ID NO: 150, 
SEQ ED NO: 151, SEQ ED NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO. 155, 
SEQ ED NO: 156, SEQ ED NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, 

10 SEQ ED NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165 

SEQ ED NO: 1 66, SEQ ED NO: 1 67, SEQ ED NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, 
SEQ ED NO: 171, SEQ ED NO: 172, SEQ ED NO: 173, SEQ ED NO: 174, SEQ ID NO: 175, 
SEQ ED NO: 176, SEQ ED NO: 177, SEQ ID NO: 178, SEQ ED NO: 179, SEQ ID NO: 180, 
SEQ ED NO: 181, SEQ ED NO: 1 82, SEQ ED NO: 1 83, SEQ ID NO: 1 84, SEQ ED NO: 1 85 , 

15 SEQ ED NO: 186, SEQ ED NO: 187, SEQ ID NO: 188, SEQ ID NO: 189, SEQ ED NO: 190, 

SEQ ED NO:191, SEQ ED NO:I99, SEQ ED NO:193, SEQ ED NO:194, SEQ ED NO:195, 
SEQ ED NO: 196, SEQ ED NO: 197, SEQ ID NO: 1 98, SEQ ED NO: 1 99, SEQ ED NO:200, 
SEQ ED NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, 
SEQ ED NO:206, SEQ ED NO:207, SEQ ED NO:208, SEQ ID NO:209, SEQ ID NO:2 1 0, 

20 SEQ ED NO:21 1, SEQ ID NO:212. SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, 

SEQ ED NO:216, SEQ ID NO:217, SEQ ED NO:218, SEQ ID NO:219, SEQ ID NO:220, 
SEQ ED NO:221, SEQ ED NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, 
SEQ ED NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, 
SEQ ED NO:231, SEQ ED NO:232, SEQ ED NO:233, SEQ ED NO:234, SEQ ID NO:235, 

25 SEQ ED NO:236, SEQ ED NO:237, SEQ ED NO:238, SEQ ID NO:239, SEQ ID NO:240, 

SEQ ID NO:241, and SEQ ID NO:242. 
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1 1 The polypeptide of claim 10, w herein said polypeptide is a fragment of the 
protein encoded by an amino acid sequence selected from the group consisting of SEQ ID 
N0122. SEQ ID NO.123. SEQ ID NO:124, SEQ ID NO:125. SEQ ID NO:126, SEQ ID 
NO. 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO:131, SEQ ID 
5 NO 132, SEQ ID NO:133. SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID 

NO. 137, SEQ ID NO:13S, SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID 
NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ED 
NO: 147. SEQ ID NO: 148, SEQ ED NO: 149, SEQ ED NO: 150, SEQ ED NO: 1 51, SEQ ID 
NO:152, SEQ ID NO:153, SEQ ID NO: 154, SEQ ID NO:155, SEQ ID NO:I56, SEQ ID 
10 NO:157, SEQ ID NO: 158, SEQ ED NO:159, SEQ ID NO:160, SEQ ID NO: 161, SEQ ID 

NO:162, SEQ ID NO:163, SEQ ID NO: 164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID 
NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ ED NO: 170, SEQ ID NO: 171, SEQ ID 
NO: 172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID 
NO. 1 77, SEQ ID NO: 1 78, SEQ ED NO: 1 79, SEQ ED NO: 1 80, SEQ ID NO: 181, SEQ ID 
15 NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ED NO. 1 85, SEQ ID NO: 1 86, SEQ ID 

NO:187, SEQ ID NO:188, SEQ ED NO:189, SEQ ED NO:190, SEQ ID NO:191, SEQ ID 

- — NO:199, SEQ ID NO:193, SEQ ED NO:194, SEQ ED NO:195, SEQ ID NO:196, SEQ ID 

NO: 197, SEQ ED NO: 198, SEQ ID NO: 199, SEQ ED NO:200, SEQ ED NO:201, SEQ ID 
NO:202, SEQ ID NO:203, SEQ ED NO:204, SEQ ED NO:205, SEQ ED NO:206, SEQ ID 
20 NO:207, SEQ ED NO:208, SEQ ED NO:209, SEQ ED NO:210, SEQ ED NO:21 1, SEQ ID 

NO:212, SEQ ED NO:213, SEQ ID NO:214, SEQ ED NO:215, SEQ IDNO:216, SEQ ID 
NO:217, SEQ ID NO:218, SEQ ED NO:219, SEQ ED NO:220, SEQ ID NO:221, SEQ ID 
NO:222, SEQ ID NO:223, SEQ ED NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID 
25 NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ED NO:235, SEQ ID NO:236, SEQ ID 

NO:237, SEQ ED NO:238, SEQ ID NO:239, SEQ ED NO:240, SEQ ID NO:241, and SEQ 
ID NO:242. 

12. The polypeptide of claim 10, wherein said polypeptide composes: 

(a) an amino acid sequence selected from the group consisting of SEQ 
30 ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ED NO:125, SEQ ID NO:126, SEQ 

ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ED NO: 130, SEQ ID NO: 131, SEQ 
ID NO:132, SEQ ID NO:133. SEQ ED NO: 134, SEQ ID NO:135, SEQ ID NO:136. SEQ 
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ID NO: 1 37, SEQ ID NO: 1 38, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ 
ID NO:142. SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO:145, SEQ ID NO 146, SEQ 
ID NO: 147, SEQ ID NO: 14-8, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID NO 151, SEQ 
ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 1 54, SEQ ID NO: 155, SEQ ID NO: 156, SEQ 
ID NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161 , SEQ 
ID NO:162, SEQ ID NO:163, SEQ rD NO: 164, SEQ ID NO:165. SEQ ID NO: 166, SEQ 
ID NO:167, SEQ ID NO.168, SEQ ID NO: 169, SEQ ID NO. 1 70, SEQ ID NO: 1 71 , SEQ 
ID NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ 
ID NO: 1 77, SEQ ID NO: 1 78, SEQ ED NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 181, SEQ 
ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ 
ED NO: 1 87, SEQ ED NO: 1 88, SEQ ED NO: 1 89, SEQ ED NO: 1 90, SEQ ED NO: 1 9 1 , SEQ 
ID NO. 1 99, SEQ ED NO: 1 93, SEQ ED NO: 1 94, SEQ ED NO: 1 95, SEQ ID NO: 1 96, SEQ 
ED NO: 1 97, SEQ ED NO: 1 98, SEQ ED NO: 1 99, SEQ ED NO:200, SEQ ED NO:201, SEQ 
ID NO:202, SEQ ED NO:203, SEQ ED NO:204, SEQ ED NO:205, SEQ ID NO:206, SEQ 
IDNO:207, SEQ ED NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ 
ID NO:212, SEQ ED NO:213, SEQ ED NO:214, SEQ ED NO:215, SEQ ID NO:216, SEQ 
ED NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ED NO:220, SEQ ID NO:221, SEQ 
ED NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ED NO:225, SEQ ID NO:226, SEQ 
ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ 
ID NO:232, SEQ ID NO:233, SEQ ID N0.234, SEQ ED NO:235, SEQ ID NO:236, SEQ 
ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and 
SEQ ED NO:242; 

(b) an amino acid sequence selected from the group consisting of SEQ 
ED NO:122, SEQ ED NO:123, SEQ ED NO:124, SEQ ED NO:125, SEQ ED NO:126, SEQ 
ID NO: 127, SEQ ID NO: 128, SEQ ED NO: 129, SEQ ED NO: 130, SEQ ID NO: 131, SEQ 
ED NO: 132, SEQ ED NO: 133, SEQ ED NO: 134, SEQ ED NO: 135, SEQ ED NO: 136, SEQ 
ID NO:137, SEQ ID NO:138. SEQ IDNO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ 
ID NO:142, SEQ ID NO:143, SEQ ED NO: 144, SEQ ID NO:145, SEQ ID NO:146, SEQ 
ID NO: 1 47. SEQ ID NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ 
ED NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ 
ID NO:157, SEQ ID NO: 158. SEQ ID NO:159, SEQ ID NO: 1 60, SEQ ID NO:161. SEQ 
ID NO: 162. SEQ ID NO: 163, SEQ ID NO: 164. SEQ ID NO: 165. SEQ ID NO: 166. SEQ 
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ID NO: 167, SEQ ID NO 16S. SEQ ID NO 169. SEQ ID NO: 170. SEQ ID NO: 171. SEQ 
ID NO: 172. SEQ ID NO: 173. SEQ ID NO 174. SEQ ID NO: 175. SEQ ID NO: 176. SEQ 
ID NO: 177, SEQ ID NO:17S, SEQ ID NO: 179, SEQ ID NO: ISO, SEQ ID NO:181, SEQ 
ID NO: 1 82. SEQ ID NO: 1 S3. SEQ ID NO 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ 
ID NO: 187, SEQ ID NO ISS, SEQ ID NO 1 89, SEQ ID NO:190, SEQ ID NO:191, SEQ 
ID NO:199, SEQ ID NO:193, SEQ ID NO 194, SEQ ID NO:195, SEQ ID NO:196, SEQ 
ID NO: 1 97, SEQ ED NO: 1 98, SEQ ID NO: 1 99, SEQ ID NO:200, SEQ ID NO:20 1 , SEQ 
ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ 
ID NO. 207, SEQ ID NO:208, SEQ ID NO209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ 
ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ 
ID NO:2 1 7, SEQ ID NO:2 1 8, SEQ ID NO:2 1 9, SEQ ID NO:220, SEQ ID NO:22 1 , SEQ 
ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ 
ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1 , SEQ 
ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ 
ID NO:237, SEQ ED NO:238, SEQ ID NO:239, SEQ ED NO:240, SEQ ED NO:241 , and 
SEQ ID NO:242, except that it lacks one or more, but not all of the domains selected from 
the group consisting of an N-terminal domain, a catalytic domain, a C-terminal domain, a 
spacer region, a proline-rich region, a coiled-coil structure region, and a C-terminal tail 

(c) a domain of an amino acid sequence selected from the group set 
forth in SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID 
NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID.NO: 1 30, SEQ ID 
NO: 131, SEQ ID NO: 1 32, SEQ ID NO: 1 33, SEQ ID NO: 1 34, SEQ ID NO: 1 35, SEQ ID 
NO: 1 36, SEQ ID NO: 137, SEQ ID NO: 1 38, SEQ ID NO: 1 39, SEQ ID NO: 1 40, SEQ ID 
NO: 141, SEQ ID NO: 1 42, SEQ ED NO: 1 43, SEQ ID NO: 1 44, SEQ ID NO: 1 45, SEQ ID 
NO: 1 46, SEQ ID NO: 1 47, SEQ ID NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID 
NO: 151, SEQ ID NO: 1 52, SEQ ID NO: 153, SEQ ID NO: 1 54, SEQ ID NO: 155, SEQ ID 
NO: 156, SEQ ED NO: 157, SEQ ED NO: 158, SEQ ED NO: 159, SEQ ED NO: 160, SEQ ID 
NO: 161, SEQ ED NO: 1 62, SEQ ED NO: 1 63, SEQ ED NO: 1 64, SEQ ED NO: 1 65. SEQ ID 
NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID 
NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ ED NO: 175, SEQ ID 
NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO. 1 78, SEQ ED NO: 1 79, SEQ ED NO: 1 80, SEQ ID 
NO:181, SEQ ID NO: 182. SEQ ID NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID 
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NO: 1 86, SEQ ID NO: 187, SEQ ID NO: 188. SEQ ID NO: 189. SEQ ID NO: 190, SEQ ID 
NO: 191, SEQ ID NO: 199, SEQ ID NO: 193, SEQ ID NO: 194. SEQ ID NO: 195, SEQ ID 
NO: 196, SEQ ID NO: 197. SEQ ID NO: 198, SEQ ID NO: 199. SEQ ID NO:200. SEQ ID 
NO.201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204. SEQ ID NO:205, SEQ ID 
NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209. SEQ ID NO:2 10, SEQ ID 
NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214. SEQ ID NO:215, SEQ ID 
NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID 
NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ED NO:224, SEQ ID NO:225, SEQ ID 
NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID 
NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID 
NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID 
NO:241, and SEQ ED NO:242 wherein said domain is selected from the group consisting 
of a C-terminal domain, a catalytic domain, an N-terminal domain, a spacer region, a 
proline-rich region, a coiled-coil structure region, and a C-terminal tail. 

13. The kinase polypeptide of claim 10, wherein said polypeptide is isolated, 
purified, or enriched from a mammal. 

14. The kinase polypeptide of claim 13, wherein said mammal is a human. 

15. The kinase polypeptide of claim 10, wherein said polypeptide is a 
AA 144574, AA1 16841, AA256 100, AA305176, AA2 10825, AA3 16804, AA980090, 
N42050, AA476563, AA626690, AA960957, H19102, AA 045601, AA107515, 
AA1 09508 or AA887783 polypeptide. 

16. The kinase polypeptide of claim 10, wherein said polypeptide is a H60215, 
AA197883, AA297313, W30246, AA172300, AA383293, AA542015, H01248, N23936, 
W44160, 2R22-5-1 1, 5R72-18-1, AA021445, AA207220, AA426580, AA544838, 
W90839, 5R79-54-1, AA839940, R19772 or 5R72-8-2 polypeptide. 

1 7. The kinase polypeptide of claim 1 0, wherein said polypeptide is a 
AA234451 polypeptide. 

1 8. The kinase polypeptide of claim 1 0, wherein said polypeptide is a 5R65- 1 6- 
1, AA061797, AA065538, AA124976, AA397553, AA435956, AA575635, AA626859, 
AA789239, AI086865, HI 7727, H29974, AA557536 or N28606 polypeptide 

19. The kinase polypeptide of claim 10, wherein said polypeptide is a 
AA631990 or W08549 polypeptide. 
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20. The kinase polypeptide of claim 10. wherein said polypeptide is a 5R72-16 
2, R 19927 or R43524 polypeptide. 

2 1 . The kinase polypeptide of claim 1 0, wherein said polypeptide is a 5R57- 1 o 
2 polypeptide. 

22. The kinase polypeptide of claim 10. wherein said polypeptide is a 
AA232253 polypeptide. 

23. The kinase polypeptide of claim 10, wherein said polypeptide is a 
AA430250, AA836348, R86668 or N34132 polypeptide. 

24. The kinase polypeptide of claim 10, wherein said polypeptide is a 
AA098024or SuRTK106 polypeptide. 

25. The kinase polypeptide of claim 10, wherein said polypeptide is a R47805, 
AA099102, AA589241, H8581 1, AA013524, AA452647, AA840598, AA088547, 
AA139478, AA826850, R87679, W65887, H97685, W20810, AA599286, AA425725, 
AA103218, AA711829, AA060026, AA399669, AA758539, AA883975, AA948538, 
AA018361, AA21531 1, AA31 1714, AA498104, 5R69-17-2, 5R69-23-3, 5R69-26-2, 
AA1 18352, AA396601, AA671275, AA278842, AA460132 orH05721 polypeptide. 
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26. An antibody or antibody fragment having specific binding affinity to a 
kinase polypeptide selected from the group consisting of SEQ ID NO: 122, SEQ ID 
NO: 1 23, SEQ ID NO: 1 24, SEQ ID NO: 1 25, SEQ ID NO: 1 26. SEQ ID NO: 127, SEQ ID 
NO: 128, SEQ ID NO: 129, SEQ ID NO: 130. SEQ ID NO: 131. SEQ ID NO: 132, SEQ ID 
NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID 
NO: 138. SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141. SEQ ID NO: 142. SEQ ID 
NO:143. SEQ ID NO: 144, SEQ II) NO:145, SEQ ID NO: 146, SEQ ID NO:147, SEQ ID 
NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID NO: 151, SEQ ID NO: 1 52, SEQ ID 
NO: 1 53, SEQ ID NO: 154, SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID NO: 1 57, SEQ ID 
NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID NO:161, SEQ ID NO:162, SEQ ID 
NO: 163, SEQ ID NO: 164, SEQ ED NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID 
NO: 168, SEQ ID NO: 169, SEQ ID NO: 1 70. SEQ ID NO: 1 71 , SEQ ID NO. I 72, SEQ ID 
NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76. SEQ ID NO: 1 77, SEQ ID 
NO: 1 78, SEQ ED NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID 
NO: 1 83, SEQ ID NO: 1 84, SEQ ED NO: 1 85, SEQ ED NO: 1 86, SEQ ED NO: 1 87, SEQ ID 
NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ED NO: 199, SEQ ID 
NO: 1 93, SEQ ED NO: 1 94, SEQ ED NO: 1 95, SEQ ED NO: 1 96, SEQ ED NO: 1 97, SEQ ID 
NO: 198, SEQ ED NO: 199, SEQ ID NO:200, SEQ ED NO:201, SEQ ID NO:202, SEQ ID 
NO:203, SEQ ID NO:204, SEQ ED NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID 
NO:208, SEQ ID NO:209, SEQ ED NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID 
NO:213, SEQ ED NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID 
NO:218, SEQ ED NO:219, SEQ ID NO:220, SEQ ID NO:22L SEQ ID NO:222, SEQ ID 
NO:223, SEQ ID NO:224, SEQ ED NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO:228, SEQ ED NO:229, SEQ ID NO:230, SEQ ED NO:23 1 , SEQ ID NO:232, SEQ ID 
NO:233.. SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID 
NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:24], and SEQ ID NO:242. 

27. The antibody or antibody fragment of claim 26. wherein said polypeptide 
comprises: 

(a) an amino acid sequence selected from the group consisting of SEQ 
ID NO: 122. SEQ ID NO: 123, SEQ ED NO: 124, SEQ ID NO: 125, SEQ ED NO: 126, SEQ 
ID NO. 127, SEQ ED NO: 1 28, SEQ ED NO: 1 29. SEQ ED NO: 1 30, SEQ ID NO: 131. SEQ 
ID NO: 132, SEQ ED NO: 133, SEQ ID NO: 134. SEQ ID NO: 135. SEQ ID NO: 136. SEQ 
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ID NO 137, SEQ ID NO:13S. SEQ ID NO:l?9. SHQ ID NO: 140. SEQ ID NO: 141. SEQ 
ID NO:142, SEQ ID NO:143. SEQ ID NO:144. SEQ ID N0145. SEQ ID NO140. SEQ 
ID NO: 147. SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID NO:151, SEQ 
ID NO: 1 52, SEQ ID NO: 153. SEQ ID NO: 1 54, SEQ ID NO: 155, SEQ ID NO: 1 56, SEQ 
ID NO: 157, SEQ ID NO: 158. SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ 
ID NO: 1 62, SEQ ID NO: 1 63. SEQ ID NO: 1 64, SEQ ID NO: 1 65. SEQ ID NO: 1 66, SEQ 
ID NO: 1 67, SEQ ID NO: 1 68, SEQ ED NO: 169, SEQ ID NO: 1 70, SEQ ID NO: 171, SEQ 
ID NO: 1 72, SEQ ID NO: 1 73, SEQ ED NO: 1 74, SEQ ED NO: 1 75, SEQ ID NO: 1 76, SEQ 
ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 181, SEQ 
ID NO: 182, SEQ ED NO: 183, SEQ ID NO: 184, SEQ ED NO: 185, SEQ ID NO: 186, SEQ 
ID NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ ID NO:191, SEQ 
ID NO: 1 99, SEQ ID NO: 1 93, SEQ ID NO: 1 94, SEQ ID NO: 1 95, SEQ ID NO: 1 96, SEQ 
ID NO:197, SEQ ID NO:198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ 
ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ 
ID NO:207, SEQ ED NO:208, SEQ ID NO:209, SEQ ED NO:2 1 0, SEQ ID NO:2 1 1 , SEQ 
ID NO:212, SEQ ID NO:213, SEQ ID NO.214, SEQ ID NO:215, SEQ ID NO:216, SEQ 
"ID NO:2 1 7, SEQ ID NO:2 1 8, SEQ ID NO:2 1 9, SEQ ID NO:220, SEQ ID NO:22 1 , SEQ 
ED NO:222, SEQ ED NO:223, SEQ ED NO:224, SEQ ED NO:225, SEQ ID NO:226, SEQ 
ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1 , SEQ 
ED NO:232, SEQ ID NO:233, SEQ ID NO.234, SEQ ED NO:235, SEQ ID NO:236, SEQ 
ED NO:237, SEQ ED NO:238, SEQ ID NO:239, SEQ ED NO:240, SEQ ED NO:241, and 
SEQ ED NO:242; 

(b) an amino acid sequence selected from the group consisting of SEQ 
ID NO:122, SEQ ED NO:123, SEQ IDNO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ 
ID NO:127, SEQ ID NO:128, SEQ ID NO: 129. SEQ ID NO:130, SEQ ID NO:131, SEQ 
ID NO:132, SEQ ED NO:133, SEQ ID NO: 134, SEQ ID NO:135, SEQ ID NO:136, SEQ 
ID NO:137, SEQ ED NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ 
ID NO:142, SEQ ID NO:143, SEQ IDNO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ 
ID NO: 147, SEQ ID NO: 148, SEQ ID NO. 149, SEQ ID NO: 150, SEQ ID NO:151, SEQ 
ID NO: 152. SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ 
ID NO: 157, SEQ ID NO: 158, SEQ ID NO: 159. SEQ ID NO: 160, SEQ ID NO: 161, SEQ 
ID NO:162, SEQ ID NO:163. SEQ ID NO: 164, SEQ ID NO:l65. SEQ ID NO: 166, SEQ 
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ID NO: 1 67, SEQ ID NO: 1 68. SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 171, SEQ 
ID NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO 176, SEQ 
ID NO: 177, SEQ ID NO: 178. SEQ ID NO 179, SEQ ID NO: 180, SEQ ID NO 181. SEQ 
ID NO: 182, SEQ ID NO:lS3, SEQ ID NO 1 84, SEQ ID NO 1 85, SEQ ID NO 186, SEQ 
5 ID NO: 187, SEQ ID NO: 188, SEQ ID NO 189, SEQ ID NO: 190, SEQ ID NO 191, SEQ 

ID NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ 
ID NO: 197, SEQ ID NO: 1 98, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:20! , SEQ 
ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ 
ID NO:207, SEQ ID NO:20S, SEQ ID NO:209, SEQ ID NO:2 1 0, SEQ ID NO:2 1 1 , SEQ 

10 ID NO:212, SEQ ED NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ 

ID NO:2 1 7, SEQ ED NO:2 1 8, SEQ ID NO:21 9, SEQ ED NO:220, SEQ ID NO:22 1 , SEQ 
ID NO:222, SEQ ED NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ED NO:226, SEQ 
ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ED NO:230, SEQ ED NO:23 1 , SEQ 
ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ED NO:235, SEQ ED NO:236, SEQ 

1 5 ED NO:237, SEQ ED NO:238, SEQ ED NO:239, SEQ ED NO:240, SEQ ID NO:241 , and 

SEQ ED NO:242, except that it lacks one or more, but not all, of the domains selected from 
the group consisting of a C-terminal domain, a catalytic domain, an N-terminal domain, a 
spacer region, a proline-rich region, a coiled-coil structure region, and a C-terminal tail. 

(c) a domain of an amino acid sequence selected from the group set 

20 forth in SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID 

NO: 126, SEQ ED NO:127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID 
NO:131, SEQ ED NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID 
NO: 136, SEQ ED NO:137, SEQ ID NO:138, SEQ ED NO:139. SEQ ID NO: 140, SEQ ID 
NO: 141, SEQ ED NO: 142, SEQ ED NO: 143, SEQ ED NO: 144, SEQ ED NO: 145. SEQ ID 

25 NO: 146, SEQ ID NO: 147, SEQ ED NO: 148, SEQ ED NO: 149. SEQ ID NO: 150, SEQ ID 

NO: 151, SEQ ED NO: 152, SEQ ID NO: 153, SEQ ED NO: 154. SEQ ED NO: 155, SEQ ID 
NO: 1 56, SEQ ID NO: 1 57, SEQ ED NO: 1 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID 
NO: 161, SEQ ED NO: 162, SEQ ED NO: 163, SEQ ED NO: 164. SEQ ID NO: 165. SEQ ID 
NO:166, SEQ ID NO:167, SEQ ED NO:168, SEQ ED NO.169, SEQ ED NO:170, SEQ ID 

30 NO: 171, SEQ ED NO: 172, SEQ ID NO: 173. SEQ ED NO: 174. SEQ ID NO: 175, SEQ ID 

NO: 1 76, SEQ ED NO: 1 77, SEQ ID NO: 1 78. SEQ ID NO: 1 79. SEQ ID NO: 1 80. SEQ ID 
NO: 1 8 1 . SEQ ID NO: 1 82. SEQ ID NO: 1 83. SEQ ID NO: 1 84. SEQ ID NO: 1 85. SEQ ID 
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NO: 186. SEQ ID NO: 187. SEQ ID NO: 188, SEQ ID NO: 189. SEQ ID NO: 190, SEQ ID 
NO:191, SEQ ID NO:199. SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID 
NO: 196. SEQ ID NO: 197, SEQ ID NO: 198, SEQ ID NO: 199. SEQ ID NO:200, SEQ ID 
NO:201, SEQ ID NO:202. SEQ ID NO:203. SEQ ID NO:204, SEQ ID NO:205, SEQ ID 
5 NO:206. SEQ ID NO:207. SEQ ID NO:208. SEQ ID NO 209, SEQ ID NO:2 1 0, SEQ ID 

NO:21 1, SEQ ID NO:212. SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID 
NO:216, SEQ ID NO:217. SEQ ID NO:21S, SEQ ID NO:219, SEQ ID NO:220, SEQ ID 
NO:221, SEQ ID NO.222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID 
NO:226, SEQ ID NO:227. SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID 
1 0 NO:23 1 , SEQ ID NO:232. SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID 

NO:236, SEQ ID NO:237, SEQ ED NO:238, SEQ ID NO:239, SEQ ID NO.240, SEQ ID 
NO:241, and SEQ ID NO:242 wherein said domain is selected from the group consisting 
of a C-terminal domain, a catalytic domain, an N-terminal domain, a spacer region, a 
prolme-rich region, a coiled-coil structure region, and a C-terminal tail. 
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28. A hybridoma which produces an antibody having specific binding affinity 
to a kinase polypeptide selected from the group consisting of SEQ ID NO: 1 22, SEQ ID 
NO: 123. SEQ ID NO:124, SEQ ID NO: 125, SEQ ID NO: 126. SEQ ID NO: 127, SEQ II) 
NO: 128. SEQ ID NO: 129, SEQ ID NO: 130. SEQ ID NO. 131. SEQ ID NO: 132. SEQ ID 

5 NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NOM37. SEQ ID 

NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID 
NO: 143, SEQ ID NO: 144, SEQ ID NO:145, SEQ ID NO:i46, SEQ ID NO:147. SEQ ID 
NO: 148, SEQ ED NO: 149, SEQ ID NO: 150, SEQ ID NO:151, SEQ ID NO: 152, SEQ ID 
NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156. SEQ ID NO: 157. SEQ ID 

10 NO:l58, SEQ ID NO:159, SEQ ID NO:160, SEQ ID NO:161. SEQ ID NO:162, SEQ ID 

NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID NO:167, SEQ ID 
NO:168, SEQ ID NO:169, SEQ ED NO:170, SEQ ID NO:171, SEQ ID NO:172, SEQ ID 
NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID 
NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ED NO. 1 82, SEQ ID 

15 NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ED NO: 1 87, SEQ ID 

NO: 188, SEQ ED NO: 189, SEQ ED NO: 190, SEQ ED NO: 191, SEQ ED NO: 199, SEQ ID 
NO:193, SEQ ED NO:194, SEQ ED NO:195, SEQ ED NO:196, SEQ ID NO:197, SEQ ID 
NO: 198, SEQ ED NO:199, SEQ ID NO:200, SEQ ED NO:201. SEQ ID NO:202, SEQ ID 
NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206. SEQ ID NO 207, SEQ ID 

20 NO:208, SEQ ED NO:209, SEQ ID NO:2 1 0, SEQ ID NO:2 1 1 . SEQ ID NO:2 1 2, SEQ ID 

NO:213, SEQ ID NO:214, SEQ ED NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID 
NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID N0 222, SEQ ID 
NO:223, SEQ ID NO:224, SEQ ED NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO:228, SEQ ED NO:229, SEQ ID NO:230, SEQ ED NO:23 1 , SEQ ID NO.232, SEQ ID 

25 NO:233, SEQ ID NO:234, SEQ ED NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID 

NO:238, SEQ ID NO:239, SEQ ED NO:240, SEQ ID NO:241, and SEQ ID NO:242. 

29. A method for identifying a substance that modulates kinase activity 
comprising: 

(a) contacting a kinase polypeptide selected from the group consisting 
30 SEQ ED NO: 122, SEQ ID NO: 123. SEQ ID NO: 124. SEQ ID NO: 125. SEQ ID NO: 126. 

SEQ ED NO: 127, SEQ ID NO: 128. SEQ ID NO: 129, SEQ ID NO: 1 30. SEQ ID NO:131. 
SEQ ED NO: 1 32, SEQ ID NO: 1 33. SEQ ID NO: 1 34. SEQ ID NO: 135. SEQ ID NO 1 36. 



:vjci;; - wo 



WO 00/73469 



PCT/LSO0/1484 



15? 

SEQ ID NO 137, SEQ ID NO 138. SEQ ID NO:139. SEQ ID NO 140. SEQ ID NO:141. 
SEQ ID NO 142. SEQ ID NO 143. SEQ ID NO: 144. SEQ ID NO 145, SEQ ID NO: 146. 
SEQ ID NO 147, SEQ ID NO:14S, SEQ ID NO: 149, SEQ ID NO 150. SEQ ID NO:151. 
SEQ ID NO. 152, SEQ ID NO: 153. SEQ ID NO: 154, SEQ ID NO. 155, SEQ ID NO: 156. 
SEQ ID NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161 , 
SEQ ID NO 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166. 
SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ ID NO: 170, SEQ ID NO:17L 
SEQ ED NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76. 
SEQ ID NO 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 . 
SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, 
SEQ ED NO: 187, SEQ ID NO: 188, SEQ ED NO: 189, SEQ ED NO: 190, SEQ ID NO: 191, 
SEQ ID NO: 1 99, SEQ ID NO: 1 93, SEQ ID NO: 1 94, SEQ ID NO: 1 95, SEQ ID NO: 1 96, 
SEQ ED NO: 1 97, SEQ ED NO: 1 98, SEQ ED NO: 1 99, SEQ ED NO:200, SEQ ED NO:20 1 , 
SEQ ID NO:202, SEQ ID NO:203, SEQ ED NO:204, SEQ ED NO:205, SEQ ED NO:206, 
SEQ ID NO:207, SEQ ED NO:208, SEQ ED NO:209, SEQ ED NO:210, SEQ ED NO:2 1 1 , 
SEQ ED NO:212, SEQ ED NO:213, SEQ ED NO:214, SEQ ED NO:215, SEQ ID NO:216, 
' SEQ ED NO:217, SEQ ED NO:218, SEQ ED NO:219, SEQ ED NO:220, SEQ ID NO:221, 
SEQ ED NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ED NO:226, 
SEQ ED NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230. SEQ ID NO:231 , 
SEQ ED NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, 
SEQ ED NO:237, SEQ ED NO:238, SEQ ID NO:239, SEQ ID NO:24Q, SEQ ID NO:241 , 
and SEQ ID NO:242 with a test substance; 

(b) measuring the activity of said polypeptide; and 

(c) determining whether said substance modulates the activity of said 

polypeptide. 

30. A method for identifying a substance that modulates kinase activity in a 
cell comprising: 

(a) expressing a kinase polypeptide in a cell, wherein said polypeptide 
is selected from the group consisting of SEQ ID NO: 1 22, SEQ ID NO: 1 23, SEQ ID 
NO:124, SEQ ID NO:125. SEQ ID NO:126, SEQ ID NO:127. SEQ ID NO:128, SEQ ID 
NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132. SEQ ID NO:133, SEQ ID 
NO: 134, SEQ ID NO: 135, SEQ ID NO: 136. SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID 
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NO:139. SEQ ID NO: 140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143. SEQ ID 
NO:144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO:147. SEQ ID NO:148, SEQ ID 
NO:149, SEQ ID NO:150, SEQ ID NO: 151. SEQ ID NO:152. SEQ ID NO 153, SEQ ID 
NO: 154, SEQ ID NO:155, SEQ ID NO: 156, SEQ ID NO:157, SEQ ID NO 158, SEQ ID 
NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162. SEQ ID NO. 163, SEQ ID 
NO: 1 64, SEQ ID NO: 1 65 . SEQ ID NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO. 1 68, SEQ ID 
NO: 169. SEQ ID NO: 170, SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, SEQ ID NO: 173, SEQ ID 
NO: 1 74, SEQ ED NO: 1 75, SEQ ED NO: 1 76, SEQ ED NO: 1 77, SEQ ID NO: 1 78, SEQ ID 
NO:179, SEQ ID NO:180, SEQ ID NO:181, SEQ ID NO:182. SEQ ID NO:183, SEQ ID 
NO: 1 84, SEQ ID NO: 1 85, SEQ ED NO: 1 86, SEQ ED NO: 1 87, SEQ ID NO: 1 88, SEQ ID 
NO: 189, SEQ ED NO: 190, SEQ ID NO: 191, SEQ ID NO: 199, SEQ ED NO: 193, SEQ ID 
NO: 194, SEQ ED NO: 195, SEQ ID NO: 196, SEQ ED NO: 197, SEQ ED NO: 198, SEQ ID 
NO:199, SEQ ED NO:200, SEQ ED NO:201, SEQ ED NO:202, SEQ ED NO:203, SEQ ID 
NO:204, SEQ ED NO:205, SEQ ED NO:206, SEQ ED NO:207, SEQ ID NO:208, SEQ ID 
NO:209, SEQ ED NO:210, SEQ ED NO:21 1, SEQ ED NO:212, SEQ ED NO:213, SEQ ID 
NO:214, SEQ ED NO:215, SEQ ED NO:216, SEQ ED NO:217, SEQ ID NO:218, SEQ ID 
NO:219, SEQ ED NO:220, SEQ ED NO:221, SEQ ED NO:222, SEQ ID NO:223, SEQ ID 
NO:224, SEQ ED NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID 
NO:229, SEQ ID NO:230, SEQ ED NO:23 1, SEQ ID NO:232, SEQ ID NO:233, SEQ ID 
NO:234, SEQ ED NO:235, SEQ ID NO:236, SEQ ID NO:237. SEQ ID NO:238, SEQ ID 
NO:239, SEQ ID NO:240, SEQ ED N0.241, and SEQ ED NO:242; 

(b) adding a test substance to said cell; and 

(c) monitoring a change in cell phenotype or the interaction between 
said polypeptide and a natural binding partner. 
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31. A method tor treating a disease or disorder by administering to a patient in 
need of such treatment a substance that modulates the activity of a kinase selected from 
the group consisting of SEQ ID NO: 122, SEQ ID NO: 123. SEQ ID NO: 124, SEQ ID 
NO. 125, SEQ ID NO:126. SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO 129. SEQ ID 
NO 1 30, SEQ ID NO: 131. SEQ ID NO: 1 32, SEQ ID NO: 1 33, SEQ ID NO: 1 34, SEQ ID 
NO 135, SEQ ID NO:136. SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID 
NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID 
NO: 145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID 
NO. 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54, SEQ ID 
N0155, SEQ ID NO.T56, SEQ ID NO:157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID 
NO. 1 60, SEQ ID NO: 1 61 , SEQ ED NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, SEQ ID 
NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ ID 
NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, SEQ ED NO: 1 73, SEQ ED NO: 1 74, SEQ ID 
NO: 1 75, SEQ ED NO: 1 76, SEQ ED NO: 1 77, SEQ ED NO: 1 78, SEQ ED NO: 1 79, SEQ ED 
NO: 1 80, SEQ ED NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ED NO: 1 83, SEQ ED NO: 1 84, SEQ ID 
NO. 185, SEQ ID NO:186, SEQ ED NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ ID 

vNO: 1 90, SEQ ID NO: 1 91 , SEQ ID NO: 1 99, SEQ ID NO: 1 93, SEQ ID NO: 1 94, SEQ ID 
NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ ED NO:199, SEQ ID 
NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID 
NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ED NO:208, SEQ ED NO:209, SEQ ID 
NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID 
NO.215, SEQ ID NO:2l6, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID 
NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID 
NO:225, SEQ ED NO:226, SEQ ID NO:227, SEQ ED NO:228, SEQ ID NO:229, SEQ ID 
NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ED NO:233, SEQ ID NO:234, SEQ ID 
NO:235, SEQ ED NO:236, SEQ ED NO:237, SEQ ID NO:238, SEQ ED NO:239, SEQ ID 
NO:240, SEQ ID NO:241, and SEQ ID NO:242. 

32. The method of claim 3 1 , wherein said disease or disorder is selected from 
the group consisting of immune-related diseases and disorders, cardiovascular disease, 
neurodegenerative disorders, and cancer. 

33. The method of claim 31, wherein said substance modulates kinase activity 

in vitro 
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34. The method of claim 33, wherein said substance is a kinase inhibitor. 

35. A method for detection of a kinase polypeptide in a sample as a diagnostic 
tool for a disease or disorder, wherein said method comprises: 

(a) contacting said sample with a nucleic acid probe which hybridizes 
under hybridization assay conditions to a nucleic acid target region of a kinase polypeptide 
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(b) detecting the presence or amount of the probe:target region hybrid 

as an indication of said disease. 

36. The method of claim 35. wherein said disease or disorder is selected from 
the sroup consisting of immune-related diseases and disorders, cardiovascular disease, 
5 neurodegenerative disorders, and cancer. 

37 A method for detection of a kinase polypeptide in a sample as a diagnostic 
tool for a disease or disorder, wherein said method comprises: 

(a) comparing a nucleic acid target region encoding said kinase 
polypeptide in a sample, wherein said kinase polypeptide is selected from the group 

10 consisting of SEQ ID NO: 122, SEQ ID NO:123, SEQ IDNO:124, SEQ ID NO:125. SEQ 

ID NO:126, SEQ ID NO:127, SEQ ID N0128. SEQ ID NO:129, SEQ ID NO.130, SEQ 
ID NO:131, SEQ ID NO: 132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ 
ID NO: 1 36, SEQ ID NO: 1 37, SEQ ID NO: 1 38, SEQ ID NO: 1 39, SEQ ID NO: 1 40, SEQ 
ID NO: 141, SEQ ID NO:142, SEQ ID NO: 143, SEQ ID NO:144, SEQ ID NO:145, SEQ 

15 ID NO: 146, SEQ ID NO: 147, SEQ ED NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ 

ID NO:151, SEQ ID NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ 

; <„,. ,.4DNO:156, SEQ ED NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ 
ID NO:161, SEQ ED NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ 
ED NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ ID NO: 170, SEQ 

20 ED NO: 171, SEQ ID NO: 1 72, SEQ ED NO: 1 73, SEQ ED NO: 1 74, SEQ ID NO: 1 75, SEQ 

ED NO: 176, SEQ ID NO: 177, SEQ ID NO: 178. SEQ ID NO: 179, SEQ. ID NO: 180, SEQ 
ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ 
ID NO: 1 86, SEQ ID NO: 1 87, SEQ ED NO: 1 88, SEQ ED NO: 1 89, SEQ ID NO: 1 90, SEQ 
ED NO:191. SEQ ED NO:199, SEQ ID NO:193, SEQ ED NO:194, SEQ ID NO:195, SEQ 

25 ED NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, SEQ ED NO: 199, SEQ ID NO:200, SEQ 

ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ED NO:204, SEQ ED NO:205, SEQ 
ID NO:206, SEQ ED NO:207, SEQ ED NO:208, SEQ ED NO:209, SEQ ED NO:210, SEQ 
ID NO:21 1, SEQ ID NO:212, SEQ ED NO:213, SEQ ED NO:214, SEQ ID NO:215, SEQ 
ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ 

30 ID NO:221. SEQ ID NO:222, SEQ ID N0 223. SEQ ID NO:224, SEQ ID NO:225, SEQ 

ID NO:226, SEQ ID NO:227. SEQ ID NO:22S, SEQ ID NO:229, SEQ ID NO:230, SEQ 
ID NO:23I, SEQ ID NO:232. SEQ ID NO:233. SEQ ID N0 234. SEQ ID NO:235, SEQ 
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ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ 
ID NO:241 , and SEQ ID NO:242, or one or more fragments thereof, with a control nucleic 
acid target region encoding said kinase polypeptide, or one or more fragments thereof; and 

(b) detecting differences in sequence or amount between said target 
region and said control target region, as an indication of said disease or disorder. 

38. The method of claim 37, wherein said disease or disorder is selected from 
the group consisting of immune-related diseases and disorders, cardiovascular disease, 
neurodegenerative disorders, and cancer. 
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FIGURE 1A 



seq id no: 122_x69117_h bark2_h 

madleavladvsylmamekskatpaaraskri vlpeps i rsvmqkylaerne i tfdki fn 
qkigfllfkdfclneineavpqvkfyeeikeyekldneedrlcrsrqiydayim?:ellsc 

SHPFSKQAVEHVQSHLSKKQVTSTLFQPYIEEICESLRGDIFQKFMESDKFTRFCQWKNV 
ELNIHLTMNEFSVHRI IGRGGFGEVYGCRKADTGKMYAMKCLDKKRIKMKQGETLoALJSIER 

TMLSLVSTGDCPFIVCMTYAFHTPDKLCFILDLMNGGDLHYHLSQHGVFSEKEMRFYATE 
I ILGLEHVHNRFVVTRDLKPANILLDEHGHARI SDLGLACDFSKLKKPHASVGTHGYMAPE 

VLQKGTAYDSSADWFSLGCMLFKLLRGHSPFRQHKTKDKHEIDRMTLTVNVELPDTFSPE 
LKSLLEGLLQRDVSKRLGCHGGGSQEVKEHSFFKGVDWQHVYLQKYPPPLTPPRGEVNAA 
DAFD I GS FDEEDTKGI KLLDCDQELYKNFPLVI SERWQQEVTETV^YEAVNADTDKI EARK 
RAKNKQLGHEEDYALGKDCIMHGYMLKLGNPFLTQWQRRYFYLFPNRLEWRGEGESRQNL 
LTMEQILSVEETQIKDKKCILFRIKGGKQFVLQGESDPEFVQWKKELNETFKEAQRLLRR 
APKFLNKPRSGTVELPKPSLCHRNSNGL 

SEQ ID NO: 123_AA144574_M BARK2_M 

CFWYRDLKPANILLDEYGHVRISDLGLACDFSKKKPHASVGTHGYMAPEVLQKGTCYDS 
SADWFSLGCMLFKLLRGHSPFRQHKTKDKHEIDRMTLTVNVQLPDAFSPELRSLLEGLLQ 
RDVSQRLGCGGGGARELKEH I FFKG I DWQHVYLRKYPP PL I P PRGE VNAADAFD I GS FDE 

EDTKGIKLLDCDQDLYKNFPLVISERWQQEVVETI YDAVNADTDKIEARKKAKNKQLGQE 

EDYAMGKDCIMHGYMLKLGNPFLTQWQRRYFYLFPNRLEWRGEGESRQSLLTMEQIMSVE 

ETQIKDRKCILLRIKGGKQFVLQCESDPEFAQWLKELTCTFNEAQRLLRRAPKFLNKPRA 
AILEFSKPPLCHRNSSGL 

SEQ ID NO: 124_AA826850_H 

MGSSMSAATARRPVFDDKEDVNFDHFQILRAIGKGSFGKVCIVQKRDT^ 

QCIERIDEVRNVFRELEILQEIEHVFLWLWYSFQDEEDMFMVVDLLLGGDLRY 

FSEDTVRLYICEMALiAIiDYLRGQHI IHRDVXPDNILLDERGHAliLTDFNIATI IKDGERA 

TAIJVGTKPYMAPEIFXSFWGGTGYSFEVDWWSVGVMAYELLRGWRPYDIHSSNAVESLV 

QLFSTVSVQYVPTWSK^^ALLRKLLTWPEHRLSSLQDVQAAPALAGVLWDHLSEKRVE 

PGFVPNKGRLHCDPTFEbEEMILESRPLHKKKKRLAKNKSRDNSRDSSQSENDYLQDCLD 

AIQQDFVIFNREKLKRSQDLPREPLPAPESRDAAEPVEDEAERSALPMCGPICPSAGSG 

SEQ ID NO: 125_AA960957_K 

MGGNHSHKPPVFDENEEVNFDHFQILRAIGKGSFGKVCIVQKRDTKKKYAMKYMNKQKCI 
ERDEVTINVFRELQIMQGLEHPFLVNLWYSFQDEEDMFMVVDLLLG^ 

GWKLYI CELALALEYLQRYHI I HRDI KPDNI LLDEHGHVH I TDFNI ATWKGAERASSM 
AGTKPYMAPEVFQVYMDRGPGYS YPVDWWSLGI TAYELLRGWRP YE I HS VTP I DE I LNMF 
KVERVHYSSTWCKGMVALLRKLLTKDPESRVSSLHDIQSVPYLiADMNWDAVFK 
VPNKGRLNCDPTFELEEMILESKPLHKKKKRLAKNRSRDGTKDSCPLNGHLQHCLETVRE 
EFIIFNREKLRRQQGQGSQLLDTDSRGGGQAQSKLQDGCNNNLLTHTCTRGCSS 

SEQ ID NO: 12 6_TBK1_H 

MQSTSNHLWLLSDILGQGATANVFRGRHKKTGDLFAIK^/FNNISFLRPVDVQMREFEVLK 
KI^KNIVKLFAIEEETTTRHKVLIMEFCPCGSLYTVLEEPSNAYGLPESEFLIVLRDW 
GGMNHLRENGIVHRDIKPGNIMRVIGEDGQSVYKLTDFGAARELEDDEQFVSLYGTEEYL 
HPDMYERAVLRKDHQKKYGATVDLWSIGVTFYHAATGSLPFRPFEGPRRNKEVM 
KPSGAISGVQKAENGPIDWSGDMPVSCSLSRGLQVLLTPVLANILEADQEKCWGFDQFFA 
ETSDILHRMVIHVFSLQQMTAHKIYIKSYNTATIFHELVYKQTKI I SSNQELI YEGRRLV 
LEPGRLAQHFPKTTEENPIFWSREPLNTIGLI YEKI SLPKVHPRYDLDGDASMAKAITG 
VVCYACRI ASTLLLYQELMRKGI RWLI ELI KDDYNETVHKKTEVVITLDFCI RNI EKTVK 
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FIGURE IB 

VYEKLMKINLEAAELGE1 SDI HTKLLRLSSSQGTI ETSLQDIDSRLSPGGSLADAWAKQE 
GTHPKDRNVEKLCVLLNCKTEI YYQr KKDKAERRI^AYNEEQ IKKFDKQKXY YHATKAMTH 

FTDECVKKYEAFLNKSEEWIRKMLHLRKOLLSLTNQCFDIEEEVSKYQEYTNELQETLFC 
KMFTASSGIKKTMTPI YPSSNTLVEMTLGMKKLKEEMEGWKELAENNHI LERFGSLTMD 
GGLRNVDCL 

SEQ ID NO: 12 7_AA3 05176_H 

MDPTAGSKKEPGGGAATEEGVNRI AVPKPPS I EEFS I VKPI SRGAFGKVYLGQKGGPCLYA 
VKVVKKADM I NKNMTHQ VQA£RDA1ALS KSPF I VHLYYSLQSAM^TVTLVMEYL I GGDVKS 
LLH I YGYFDEEMAVKY I S E VALAL.DYLHRHG 1 1 HRDLKPDNML I SNEGH I KLTD FGLS KV 
T LNRD I NMMD I LTT P S 

ETSQLSQGLVCPMSVDQKDTTPYSSKLLKSCLETVASNPGMPVKCLTSNLLQSRKRLATS 
SASSQSHTFISSVESECHSSPKWEKDCQV 

SEQ ID NO: 12 8_AA1 1 684 1_M 

TRPIPWPEGEEKLSDNAQSAMDMLLTIDDSKRAGMRELKQHPLFSEVDWENLQHQTMPFV 
PQPDDETDTSYFEARNNAQHLTVSGFSL 

SEQ ID NO: 129_AA256100_H 

MAMTAGTTTTFPMSNKTRERVTVAKljTLENFYSNLILQHEERETRQKKLEVAMEEEGIx^ 
EEKKXRRSQHARKETEFLRLKRTRLGLDDFESLK^IGRGAFGEVRLVQKKDTGHIYAMKI 
LRKSDMLEKEQVAKIRAERDILVEADGAWWKMFYSFQDKRNLYLIMEFLPGGDMMTLLM 
KKDTLTEEETQFYI SETVLAIDAIHQLGFIHRDI KPDNLLLDAKGHVKLSDFGLCTGLKK 
AHRTEFYRNLTHNPPSDFSFQNl^SKRKAETWKKNRRQLAYSWGTPDYIAPEVFMQTGY 
NKLCDWWSLGVIMYEMLIGYPPFCSETPQETYRKVMNWKE 

FCIDSENRIGNSGVEEI KGHPFFEGVDWEHIRERPAAI PIEIKSIDDTSNFDDFPESDIL 
QPVPNTTEPDYKSKDWVFLNYTYKRFEGLTQRGSIPTYMKAGKL 

SEQ ID NO: 13 0_AA2 1 0 8 2 5_H 

DSLLPTPALGTPLPIPWPVGSLRTPLSLESTRSPTQRLLPSTPKDPAILRSPPPARSFLG 
SPLSHHLLTRSRGSRTQGPPGPPGGSRVGSRRAVPGLPPWPPPPHYPAGLPGS PGPGSPP 
PPGGLELQSPPPLLPQI PAPGSGVSFHIQIGLTREFVLLPAASELAHVKQLACS I VDQKF 
PECGFYGLYDKILLFKHDPTSANLLQLVRSSGDIQEGDLVEWLSASATFEDFQIRPHAL 
TVHS YRAPAFCDHCGEMLFGLVRQGLKCDGCGLNYHKRCAFS I PNNCSGARKRRLSSTSL 
ASGHSVRLGTSESLPCTAEELSRSTTELLPRRPPSSSSSSSASSYTGRPIELDKMLLSKV 
KVPHTFLIHSYTRPTVCQACKJOjLKGLFRQGL^ 

GDVPMEEATDFSEADKSALMDESEDSGVI PGSHSENALHASEEEEGEGGKAQSSLGYI PL 
MRWQSWHTTRKSSTTLREGWWHYSNKDTLRKR^ 

PLSEILTVESAQNFSLVPPGTNPHCFEIVTANATYFVGEMPGGTPGGPSGQGAEAARGLX 
ETAIRQALMPVILQDAPSAPGHAPHRQASLSISVSNSQIQENVDIATVYQI FPDEVLGSG 
QFGVVYGGKI4RKTGRDVAVKVIDKLRFPTKQESQLRNEVAILQSLRHPGIVNLECMFETP 
EKVFVVMEKLKGDMLEMILSSEKGRLPERLTKFLITQILVA.LRHLHFKNIVHCDLKPENV 
LLASADPFPQVKLCDFGFARI IGEKSFRRSWGTPAYLAPEVLLNQGYNRSLDMWSVGVI 
MYVSLSGTFPFNEDEDINDQIQNAAFMYPASPWSHISAGAIDLINNLLQVKMRKRYSVDK 
SLSHPWLQEYQTWLDLRELEGKMGERYITHESDDARWEQFAAEHPLPGSGLPTDRDLGGA 
CPPQDHDMQGLAERISVL 

SEQ ID NO: 131_AA127299_H 

IQFI I VGAKDLLAMDSNGLSDPYI KITNLSQKTKVI KKTLTPTWNETFFVHFPEPCTTLEJ , 
ECWDHDTFSDDFIGKAS I SLAEI PAIiAEVDMWIDMKTKKGEFAGK 
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FIGURE 1C 



SEQ ID NO: 132_AA316804_H 

MSANNSPPSAQKSVLPTAI PAVLPAAS PCSSPKTGLSARLSNGSFSAPSLTNSRGS VHTV 
SFLLQIGLTRESVTIEAQELSLSAVKDLVCSIVYQKFPECGFFGKYDKILLFRHDMNSEK 
ILQLITSADEIHEGDLVEWLSALATVEDFQIRPHTLYVHSYKAPTFCDYCGEMLWGLVR 

QGLKCEGCGLNYHKRCAFKIPNNCSGVRKRRLSNVSLPGPGI^SVPRPLQPEYVALPSEES 
HVHQEPSKRIPSWSGRPIWMEKMVMCRVKVPHTFAVHSYTRPTICQYCKRLLKGLFRQGM 
QCKDCKFNCHKRCASKVPRDCLGEVTFNGEPSSLGTDTDIPMDIDNNDINSDSSRGLDDT 

EEPSPPEDKMFFLDPSDLDVERDEEAVKTI SPSTSNNI PLMRWQS I KHTKRKSSTMVKE 
GWMVHYTSRDNLRKRHYWRLDSKCLTLFQNESGSKYYKEI PLSEILRI SSPRDFTNI SQG 
SNPHCFEIITDTMVYFVGENNGDSSHNPVLAATGVGLDVAQSWEKAIRQALMPVTPQASV 
CTSPGQGKDHKDLSTSISVSNCQIQENVDISTVYQIFADEVLGSGQFGIVYGGKHRKTGR 
DVAIKVIDKMRFPTKQESQLRNEVAILQNLHHPGIVNLECMFETPERVFWMEKLHGDML 
EMILSSEKSRLPERITKFMVTQILVALRNLHFKNIVHCDLKPENVLLASAEPFPQVKLCD 
FGFARI IGEKSFRRSWGTPAYLAPEVLRSKGYNRSLDMWSVGVI I YVSLSGTFPFNEDE 
DINDQIQNAAFMYPPNPWREISGEAIDLINNLLQVKMRKRYSVDKSLSHPWLQDYQTWLD 
LREFETRIGERYITHESDDARWEIHAYTHNLVYPKHFIMAPNPDDMEEDP 

SEQ ID NO: 1 3 3_PKNBETA_H 

MEEGAPRQPGPSQWPPEDEKEVIRRAIQKELKIKEGVENLRRVATDRRHLGHVQQLLRSS 
NRRLEQLHGELRELHARILLPGPGPGPAEPVASGPRPWAEQLRARHLEALRRQLHVELKV 
KQGAENMTHTCASGTPKERKLLAAAQQMLRDSQLKVAL.LRMKISSLEASGSPEPGPELLA 
EELQHRLHVKAAVAEGAKNVVKLLSSRRTQDRKALAEAQAQLQESSQKLDLLRLALEQLL 
EQLPPAHPLRSRVTRELRAAVPGYPQPSGTPVKPTALTGTLQVRLLGCEQLLTAVPGRSP 
AAAliASSPSEGWLRTKAKHQRGRGELASEVLAVLKVDNRWGQTGWGQVAEQSWDQTFVI 
PLERARELEIGVHWRDWRQLCGVAFLRLEDFLDNACHQLSLSLVPQGLLFAQVTFCDPVI 
ERRPRLQRQERIFSKRRGQDFLRRSQMNLG^4AAWGRLVMNLLPPCSSPSTISPPKGCPRT 
PTTLREASDPATPSNFLPKKTPLGEEMTPPPKPPRLYLPQEPTSEETPRTKRPHMEPRTR 
RGPSPPASPTRKPPRLQDFRCLAVLGRGHFGKVLLVQFKGTGKYYAIKALKKQEVLSRDE 
IESLYCEKRILEAVGCTGHPFLLSLLVCFQTSSHARFVTEFVPGGDLMMQIHEDVFPEPQ 
ARFYVACWLGLQFLHEKKI I YRDLKLDNLLLDAQGFLKIADFGLCKEGIGFGDRTSTFC 
GTPEFLAPEVLTQEAYTQAVDWWAIjGVLLYEMLVGECPFPGDTEEEVFDCI VNMDAPYPG 
FLSVQGLEFIQKLLQKCPEKRLGAGEQDAEEIKVQPFFRTTNWQALLARTIQPPFVPTLC 
GPADLRYFEGEFTGLPPALTPPAPHSLLTARQQAAFRDFDFVSERFLEP 

SEQ ID NO: 134_AI021023_M PKNBETA_M 

LKWDNLLLDAQGFLKIADFGLCKEGIGFGDRTSTFCGTPEFLAPEVLTQEAYTRAVDWWG 
LGVLLYEMLVGECPFPGDTEEEVFDCIVNMDAPYPGFLSVQGLEFIQKLLQKCPEKJ^LGA 
GEQDAEEIKVQPFFRTTNWQALLARTIQPPFVPTLCGPADLRYFEGEFTGLPPALTPPAP 

HSLLTARQQAAFRDFDFVSERFLEP 
SEQ ID NO: 13 5_H19102_K 

GGNIRGPWARGWKSLWTGLGTIRSDLEELWELRGHHYLHQESLKPAPVLVEKPLPEWPVP 
QFINLFLPEFPIRPIRGQQQLKILGLVAKGSFGTVLKVLDCTQKAVFAVKWPKVKVLQR 
DTVRQCKEEVSIQRQINHPFVHSLGDSWQGKRHLFIMCSYCSTDLYSLWSAVGCFPEASI 
RLFAAELVLVLCYLHDLGIMHRDVKMENILLDERGHLKLTDFGLSRHVPQGAQAYTICGT 
LQYMAPEVLSGGPYNHAADWWSLGVLLFSLATGKFPVAAERDKVAMLASVTHSDSE I PAS 
LNQGLSLLLHELLCQNPLHRLRYLKHFQVHPFFRGVAFDPELLQKQPVNFVTETQATQPS 

SAETMPFDDFDCDLESFLLYPI FA 
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FIGURE ID 

SE^ ID NC: 13C_A.\47c5C3_H 

me?fridskdsasellgldfgeklyslkse?lkpfftlpdgdsasrsfntsesk\^ef:<-^- 
dtisrgsddsvpvisfrxaafddvsgtdegrpdllvnlpgelestreaaamgptkftqtk 
igi ienklleapdvlclrlsteqcqaheekgieelsdpsgpksysitekhyaqedprmlf 
vaavdksssgdmsllpssdpkfqglgvvesavtannteeslfri csplsganeyi astdt 

LKTEEVIjLFTDQTDDLAKEEPTSLFQRDSETKGESGLVLEGDKEIHQI FEDLDKKLlALAS 
RFYIPEGCIQRWAAEMWALDALHREGIVCRDLNPNNILLNDRGHIQLTYFSRWSEVEDS 
CDSDAIERKYCAPEVGAITEETEACDWWSLGAVLFELLTGKTLVECHPAGINTHTTLNMP 
ECVSEEARSLIQQLLQFNPLERLGAGVAGVEDIKSHPFFTPVDWAELMR 

SEQ ID NO: 1 3 7_AA62 6 6 90_H 

MLPFAPQDEPWDREMEVFSGGGASSGEVNGLK>1VDEP 

VKEGYEKADPAQFELLK^LGQGSFGKVFLVRKKTGPDAGQLY^ 

KMERDILVEVNHPFIVKLHYAFQTEGKLYLILDFLRGGDVFTRLSKEVLFTEEDVKFYLA 
ELALAIJDHLHQLGIVYRDLKPENILLDEIG 

APEVVNRRGHSQSADWWSYGVLMFEMLTGTLPFQGKDRNET^^ 

QSLLRMLFKRNPANRLGSEGVEEIKRHLFFANIDWDKLYKREVQPPFKPASGKPDDTFCF 
DPEFTAKTPKDS PGLPASANAHQLFKGFSFVATS I AEEYKITPITSANVLPI VQINGNAA 
QFGEVYELKEDIGVGSYSVCKRCIHATTNMEFAVKIIDKSKRDPSEEIEILMRYGQHPNI 
ITLKDVFDDGRYVYLVTDLMKGGELLDRILKQKCFSEREASDILYVISKTVDYLHCQGW 
HRDLKPSNILYMDESASADSIRICDFGFAKQLRGENGLLLTPCYTANFVAPEVLMQQGYD 
AACDIWSLGVLFYTMI^GYTPFANGPNDTPEEILLRIGNGKFSLSGGNWDNISDGAKDLL 
SHMLHMDPHQRYTAEQILKHSWITHRDQLPNDQPKRNDVSHWKGAMVATYSALTHKTFQ 
PVLEPVAASSLAQRRSMKKRTSTGL 

SEQ ID NO: 1 3 8_AA2 1 56 8 0_H 

MSLVACECLPSPGLEPEPCSRARSQAHVYLEQIRNRVALGVPDMTK^^ 
'™ : '*TERDVSEDYEA^ 

PLSSGASPSAGFSSLRL,RPIRTLSSAVEQLRGCRWGVIE?a/QLVQDPATGGTFVVKSLP 
RCHMVSRERLTI I PHGVPYMTKLLRYFVSEDS I FLHLEHVQGGTLWSHLLSQAHSRHSGL 
SSGSTQERMKLAQLNPHLNLLTPARLPSGHAPGQDRIALEPPRTSPNIjLLtAGEAPSTRPQR 
EAEGEPTARTSTSGSSDLPKAPGGHLHLQARRAGQNSDAGPPRGLTVA/PEGAGPVLGGCG 
RGMDQSCLSADGAGRGCGRATWSVREEQVKQWAAEMLVALEALHEQGVLCRDLHPGNLLL 
DQAGH I RLTYFGQWS EVE PQCCGEAVDNLYS APE VGG I SELTEACDWWSFGSLLYELLTG 
MALSQSHPSGIQAHTQLQLPEWLSRPAASLLTELLQFEPTRRLGMGEGGVSPGLKSHPFFS 
TIQWSKLVG 

SEQ ID NO: 13 9_SGK_H 

MTVKTEAAKGTLTYSRMRGMVAI LIAFMKQRRMGLNDFIQKIANNS YACKHPEVQS I LKI 
SQPQEPELMNANPSPPPSPSQQINLGPSSNPHAKPSDFHFLKVIGKGSFGKVLLARHKAE 
EVFYAVKVLQKKAILKKK£EKHIMSERNVL^ 

GGELFYHLQRERCFLEPRARFYAAEIASALGYLHSLNI VTRDLKPENI LLDSQGHI VLTD 
FGLCKENIEHNSTTSTFCGTPEYLA.PEVLHKQPYDRTVDWWCLGAVLYEMLYGLPPFYSR 
NTAEMYDNI LNKPLQLKPNITNSARHLLEGLLQKDRTKRLGAKDDFMEI KSKVFFSLI NW 
DDLINKKITPPFNPNVSGPNELRHFDPEFTEEPVPNSIGKSPDSVLVTASVKEAAEAFLG 
FSYAPPTDSFL 

SEQ ID NO: 14 0_AA1 0 7 5 1 5_M 

MTVKAEAARSTLTYSRMRG^AILIAFMKQRRMGLNDFIQKIASNTYACKHAEVQSILW^ 
SHPQEPELMNANPS PPPS PSQQI NLGPSSNPHAKPSDFHFLKVIGKGSFGKVLLARHK/^E 
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FIGURE IE 



EVFYAVKVLQKKAILKKKEEKHIMSERNVLLKNVKHPFLVGLHFSFOTADKLYFVLDYI N 
GGELFYHLQRERCFLEPRARFYAAEIASALGYLHSLNIVYRDLKPEN'ILLDSQGHI VLTD 
XFQLRRIEHNGTTSTFCGTPEYLAPEVLHKQPYDRTVDWWCLGAVLYEMLYGLPPFYSRi: 
TAEMYDNILNKPLQLKPNITNSARHLLEGLLQKDRTKRLGAP^DDFMETKSKl ffslinwd 
DLINKKITPPFNPN\ ; SGPSDLRHFDPEFTEEPVPSSIGRSPDSILVTASVKEAAEAFLGF 
SYAPPVDSFL 

SEQ ID NO: 141_AA109508_M 

HLQRERRFLEPRARFYAAEVASAIGYLHSLNIIYRDLKPENILLDCQGHWLTDFGLCKE 
GVEPEDTTSTFCGTPEYLAPEVLRKEPYDRAVDWWCLGAVLYEMLHGLPPFYSQDVSQMY 
ENILHQPLQIPGGRTVAACDLLQSLLHKDQRQRLGSKADFLEIKNHVFFSPINWDDLYHK 
RLTPPFNPNVTGPADLKHFDPEFTQEAVSKSIGGTPDTVASSSGASSAFLGFSYAPEDDD 

ILDC 

SEQ ID NO: 142_AA887783_H 

MQRDHT^YKESCPSVXIPSSDEHREKKKRFTWK^^ 

LKKQFPAXALKIPAKRIFGDNFDPDFIKQRRAGLNEFIQNLVRYPELYNHPDVRAFLQMD 
SPKHQSDPSEDEDERSSQKLHSTSQNINLGPSGNPHAKPTDFDFLiKVIGKGSFGKVLLAK 
RKLDGKFYAVKVLQKKI VLNRKEQKHI MAERNVLLKNVKHPFLVGLHYSFQTTEKLYFVL 
DFVNGGEGHVVLTDFGLCKEGIAISDTTTTFCGTPEYLAPEVIRKQPYDNTVDWWCLGAV 
LYEMLYGLPPFYCRDVAEMYDNI LHKPLSLRPGVSLTAWS I LEELLEKDRQNRLGAKEDF 
LEIQNHPFFESLSWADLVQKKI PPPFNPNVAGPDDIRNFDTAFTEETVPYSVCVSSDYS I 
VNAS VLEADDAFVGFS YAPPS EDLFL 

SEQ ID NO: 143_R47805_H 

MAHQTGIHATEELKEFFAKARAGSVRLIKWIEDEQLVLGASQEPVGRWDQDYDRAVLPL 
LDAQQPCYLLYRLDSQNAQGFEWLFLAWSPDNSPVRLKMLYAATRATVKKEFGGGHI?CDE 
LFGTVKDDLSFAGYQKHLSSCAAPAPLTSAERELQQIRINEVKTEISVESKHQTLQGLAF 
PLQPEAQRALQQLKQKJWNYIQMKLDLERETIELVHTEPTDVAQLPSRVPRDAARYHFFL 
YKHTHEGDPLESWFIYSMPGYKCSIKERMLYSSCKSRLLDSVSQDFHLEIAKKIEIGDG 
AELTAEFLYDEVHPKQHAFKQAFAKPKGPGGKRGHKRLIRGPGENGDDS 

SEQ ID NO: 144_H60215_H 

MSKLRMKRRASDRGAGETSARAKALGSGISGNNAKRAGPFILGP 

KDGTDDFYQLKILTLEERGDQGIESQEERQGKMLLHTEYSLLSLLHTQDGWHHHGLFQD 
RTCEIVEDTESSRIWKKMKKRICLVL^ 

VI F YD WRWEALHQKNI VHRDLKLGNMVLNKRTHRI T I TNFCLGKHLVSEGDLLKDQRG 
SPAYISPDVLSGRPYRGKPSDMWALGVVLFTMLYGQFPFYDSI PQELFRKI KAAEYTI PE 
DGRVSENTVCLIRKLLVLDPQQRLAAADVLEALSAI IASWQSLSSLSGPLQWPDIDDQM 
SNADSSQEAKVTEECSQYEFENYMRQQLLLAEEKSSIHDTRSWVPKRQFGSAPPVRRLGH 

DAQPMTS LDTA I LAQRYLRK 
SEQ ID NO: 145_SGK324_H 

MASTRSIELEHFEERDKRPRPGSRRGAPSSSGGSSSSGPKGNGLIPSPAHSAHCSFYRTR 
TLQALSSEKKAKKARFYRNGDRYFKGLVFAISSDRFRSFDALLIELTRSLSDNVNLPQGV 
RTIYTIDGSRKVTSLDELLEGESYVCASNEPFRKVDYTKNINPNWSVNIKGGTSRALAAA 
SSVKSEVKESKDFI KPKLVTVI RSGVKPRKAVRILLNKKTAHS FEQVLTDI TEAI KXASG 
WKRLCTLDGKQVRVTCVHLPDFFGDDDVFIACGPEKFRYAQDDFVLDHSECRVLKSSYS 
RSSAVKYSGSKS PGPSRRSQI SAHGRSSSNVNGGPELDRC I S PEGVNGNRCSESSTLLEK 
YKIGKVI GDGNFAWKEGIDRSTGKEFAL.KI IDKAKCCGKEHLI ENEVS I LRRVKHPNX I 
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FIGURE IF 

M L VE EM E T AT E L F L VN E L VKGGD L FD A ITS S TK Y TE R D G S AMVYN LAN AL R Y LH G L S I VH 

RDIKPENLLVCEYPDGTKSLKLGDFGLATWEGPLYTVCGTPTYVAPXI IAETGYGLKVD 

I WAAGVITYI LLCGFFPFRSENNLQEDLFDQILAGKLEFPAPYWDNI TDSAKELI SQMLQ 

VNVEARCTAGQILSHPWVSDDASQErWMQAEVTGKLKQHFNNALPKONSTTTGVSVIMVS 
GRRQVWPDCGAGLEVFELGSRELPSHGSWCLP 

SEQ ID NO: 146_W3C246_M SGK324_M 

TKSSSSS PTS PGSFRGLKI SAQGRSSSNVNGGPELDRGLS PEGVNGNRCSESFPLLEKYR 
I GKVIGDGNFAWKECVDRYTGKEFALKI I DKAKCCGKEHLI ENEVS I LRRVKHPNI IML 
VEEMETATDLFLVMELVKGGDLFDAITSSTKYTERDGSAMVYNLANALRYLHSLSIVHRD 
I KPENLLVCEYPDGTKSLKLGDFGLATVVEGPLYTVCGTPTYVAPE 1 1 AETGYGLKVDVW 
AAGVITYILLCGFPPFRSENNLQEDLFDQILAGKLEFPAPYWDNITDSPCVCFRKCL 

SEQ ID NO: 147_AA383293_H 

PAAKRVVWRNGDPFFPGSQLWTQRRFPTMEAFLCEVTSAVQAPLAVRALYTPCHGHPV 

TNLADLKNRGQYVAAGFERFHKLPPYQAFCLSVFRNGDLVSPPFSLKLSQAASQDWETVL 

K1,LTE?^KLQSGAWLCTLEGLPLSAGKELVTGHYYVAVGEDEFKDLPYPALSTRGLIJ^ 

GNEAJILRSGVGTVAGSPKPLGRKAKKETCLIVTLTLKYQQSETSRDGQSFPSGVIGVYGA 

PHRRKETAGALEVADDEDTQTEEPLDQRAAQIVEQVTCLQDFFGDDDVFIACGPEKFRYA 

QDDFVLDHSRRRLLREHQAGFEKLRRTRGEEKEAEKEKKPCMSGGRRMTLRDDQPAKLEK 

EPKTRPEENKPERPSGRKPRPMGI IAANVEKHYETGRVIGDGNFAVVKECRHRETRQAYA 

MKIIDKSRLKGKEDMVDSEILIIQSLSHPNIVKLHEVYETDMEIYLILEYVQGGDLFDAI 

IESVKFPEPDAALMIMDLGKALVHMHDKSIVHRDLKPENLLVQRNEDKSTTLKI^ 

KHVVRPIFTVCGTPTYVAPEILSEKGYGLEVDMWAAGVILYILLCGFPPFRSPXXGDQDE 

LFNIIQLGHFEFLPPYVTONISDAAKI)LVSRLLVVDPKXRYTAHQVLQHPWIETAGKTNTV 
KRQKQVS PS SDGHFRSQHKR WEQVS 

•SEQ ID NO: 1 4 8_AA1 9 7 8 8 3_M 

MPTAPVLRPPPPPATPAPPAPSRPAPPI PGHRGPCDHSLKGLSSKI SERKLPGPWLPAGR 
GPLEKPVLGPRGAVMPLFSPQSSLHSVRAEHSPLKPRVVTVVKX.GGQPLR 
VQTFEQLLSDISEALGFPRWKNDRVRKLiFTLKGREVKSVSDFFREGDAFI AMGKEPLTLK 
SIQLAJ^EELYPKNRAIJ^I^PHSRVPSPRLRSRLPSKljLKGSHRCG 

RHQGKTSTVLAPEDKARAQKWVRGKQESEPGGPPSPGAATQEETHASGEKHLGVEIEKTS 
GEIVRCEKCKRERELQLGLQREPCPLGTSELDLGRAQKRDSEKLVRTKSCRRPSKAKFTD 
GEEGWKGDSHRGSPRDPPQEMRRPNSNSDKKEIRGSESQDSYPQGAPKAQKDFVEGPPAV 
EEGPIDMRREDRHTCRSKKAAWLRREQQAEPPQLPRTRGEEKQAEHEKKPGGLGERRAPE 
K^SKRKLEEKRPERPSGRKPRPKGIISADVEKHYDIGGVIGDGNFATVKECRHRETKQAY 
AMKMIDKSQLKGKEDI VDSEILI IQSLSHPNI VKLHEVYETEAEI YLIMEYVQGGDLFDA 
I VENVKFPEPEIAAVM I TDLCKAFVHMHDKNI VHRDVKPENLLVQRNEDKS I TLKLiADFGL 
AKYWRPIFTVCGTPTYVAPEILSEKGYGLEVDMWAAGVILYILLCGFPPFRSPERDQDE 
LFNI IQVGQFEFLSPYWDNI SDAAKDL\TRNLLEVDPKKJ^YTAEQVLQHPWI EMVGHTNTG 
NSQKEESPNSLGHFQSQHKKVAEQMP 

SEQ ID NO: 14 9_DRAK2_H 

MSRRRFDCRS ISGLLTTTPQI PIKMENFNNFYILTSKELGRGKFAWRQCISKSTGQEYA 
AKFLKKRRRGQDGRAEILHEIAVLELAKSGPRVINLHEVYENTSEIILILEYAAGGEIFS 
LCLPELAEMVSE^VIRLIKQILEGVYYLHQNNIVHLDLKPQNILLSSI YPLGDIKIVDF 
GMSRKIGHACELREIMGTPEYLAPEI LNYDPITTATDMWNI GI I AYMLLTHTS PFVGEDN 
QETYLNI SQVNVTDYSEETFSSVSQLiATDFI QSLLVKNPEPCRPTAE I CLSHSWLQQWDFEN 
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FIGURE 1G 

LFHPEETSSSSQTQDHSVRSSEDKTSKSSCNGTCGDREDKENIPEDSSMVSKRFRFDDSL 
PNPHELVSDLLC 

SEQ ID NO: 150_W44 160_M DRAK2_M 

MSRRRFDCRSVSGLLTTTPQTPIKTENFNNFYTLTPKELGRGKFAWRQCISKSTGQEYA 
AKSLKKRRRGQDCRAEILHEIAVLELARSCPHVINLHEVYENATEI ILVLEYAAGGEIFN 
LCLPELA£MVS V I RL I KQ I LEGW 

GMSRKI GNASELRE IMGTPEYLAPE I LNYDP I TTATDMWNI G 1 I AYMLLTHTS PFVGEDN 
QETYI^ISQVNVDYSEEMFSSVSQLATDFIQSLLVKNPEKRPTAESCLSHSWLQQWDFGS 

LFHPEETSGSSQIQDLTLRSSEEKTSKSSCNGSCGAREDKENIPEDGSLVSKRFRFDDSL 
PSPHELVPDLFC 

SEQ ID NO: 1 5 i_H0 12 4 6_H , DRAK1_H 

MIPLEKPGSGGSSPGATSGSGRAGRGLSGPCRPPPPPQARGLLTEIRAWRTEPFQDGYS 
LCPGRELGRGKFAWRKCIKKDSGKEFAAKFMRKRRKGQDCRMEIIHEIAVLELAQDNPW 
VINLHEVYETASEMILVLEYAAGGEIFDQCVADREEAFKEKDVQRLMRQI LEGVHFLHTR 

DWHLDLKPQNILLTSESPLGDIKIVDFGLSRILKNSEELREIMGTPEYVAPEILSYDPI 
SMATDMWS I GVLTYVMLTG I SPFLGNDKQETFLN I SQHNLSYSEEEFDVLSESAVDF I RT 
LLVKKPEDRATAEECLKHPWLTQSSIQEPSFRMEKALEEANALQEGHSVPEINSDTDKSE 
TEESIVTEELI VVTSYTLGQCRQSEKEKMEQKAI SKRFKFEEPLLQEIPGEFI Y 

SEQ ID NO: 152_AA021445_H 

M PAR I GYYE I DRT I GKGNFA WKRATHLVTKAKVA I KI I DKTQLDEENLKKI FREVQ IMK 

MLCHPHI IRLYQVMETERMI YLVTEYASGGEI FDHLVAHGRMAEKEARRKFKQIVTAVYF 

CHCRNIVHRDLKAENLLLDANLNIKIADFGFSNLFTPGQLLKTWCGSPPYAAPELFEGKE 

YDGPKVDIWSLGVVLYVLVCGALPFDGSTLQNLRARVLSGKFRIPFFMSTECEHLIRHNL 

VLDPNKRLSMEQICKHKWMKLGDADPNFDRLIAECQQLKEERQVDPLNEDVLLAM 

DKEQTLQSLRSDAYDHYSAIYSLLCDRHKRHKTLRLGAXPSMPRALAFQAPVNIQAEQAG 

TAMNISVPQVQLINPENQIVEPDGTLNLDSDEGEEPSPEALVRYLSMRRHTVGVADPRTE 

VMEDLQKLLPGFPGVNPQAPFLQVAPNVNFMHNLLPMQNLQPTGQLEYKEQSLLQPPTLQ 

LliNGMGPLGRRASDGGANIQLHAQQLLKRPRGPSPLVTMTPAVPAVTPVDEESSDGEPDQ 

EAVQRYLANRSKRHTLAMTNPTAEIPPDLQRQ 

TLHLPTERFS PVRRFSDGAAS IQAFKAHLEKMGNNSS I KQLQQECEQLQKMYGGQIDERT 

LEKTQQQHMLYQQEQHHQILQQQIQDSICPPQPSPPLQAACENQPALLTHQLQRLRIQPS 

SPPPNHPNNHLFRQPSNSPPPMSSAMIQPHGAASSSQFQGLPSRSAIFQQQPENCSSPPN 

VAIjTCLGMQQPAQSQQVTIQVQEPVDMLSNMPGTAAGSSGRGISISPSAGQMQMQHRTNL 

MATLSYGHRPLSKQLSADSAEAHSLNVNRFSPANYDQAHLHPHLFSDQSRGSPSSYSPST 

GVGFSPTQALKVPPLDQFPTFPPSAHQQPPHYTTSALQQALLSPTPPDYTRHQQVPHILQ 

GLLSPRHSLTGHSDIRLPPTEFAQLIKRQQQQRQQQQQQQQQQEYQELFRHMNQGDAGSL 

APSLGGQSMTERQALSYQNADSYHHHTSPQKLLQIRAQECVSQASSPTPPHGYAHQPALM 

HSESMEEDCSCEGAKDGFQDSKSSSTLTKGCHDSPLLLSTGGPGDPESLLGTVSHAQELG 

IHPYGHQPTAAFSKNPCVPSREPVIGNCMDRSSPGQAVELPDHNGLGYPARPSVHEHHRPR 

ALQRHHTIQNSDDAYVQLDNLPGMSLVAGKALSSARMSDAVLSQSSLMGSQQFQDGENEE 

CGASLGGHEHPDLSDGSQHLNSSGYPSTCITDILLSYKHPEVSFSMEQAGV 

SEQ ID NO: 153_2R22 - 5 - 1 1_H 

MTAVTMNGGGLVNPHYARWDRRJDSVESGCQTESSKEGEEGQPRQLTPFEKLTQDMSQDE?: 
VVREITLGKRIGFYRIRGEIGSGNFSQVKLGIHSLTKEKVAIKILDKTKLDQKTQRLLSR 
EISSMEKLHHPNIIRLYEVVETLSKLHLVMEYAGGGELFGKISTEGKLSEPESKLIFSQI 
VSAVKHMHENQIIHRDLKAENVFYTSNTCVPCVGDFGFSTVSKKGEMLNTFGGSPPYAAPE 
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FIGURE 1H 



LFRDEHYIGI Y\TDI WALGVLXYFMVTGTMPFRAETVAKLKKS I LEGTYSVPPHVSEPCHR 
LIRGVLQQI PTERYGIDCIMNDEWMQGVPYPTPLEPFQLDPKKLSETSTLKEEENEVKST 
LEHLGITEEHIRNNQGRDARSSITGVYRI I LHRVQRKKALES VPVMMLPDPKERDLKKG £ 
R VYRG I RHTSKFCS I L 

SEC ID NO: 154_R31237_1_H, AAC33487 

MSTRTPLPTVNERDTENHTSHGDGRQEVTSRTSRSGARCRNSIASCADEQPHIGNYRLLK 
TIGKGNFAKTVKLARHI LTGREVAI KI IDKTQLNPTSLQKLFREVRIMKI LNHPNI VKLFE 

VIETEKTLYLIMEYASGGEVFDYLVAHGRMKEKEARSKFRQI VSAVQYCHQKRI VHRDLK 
AENLLLDADMNIKIADFGFSNEFTVGGKLDTFCGSPPYAAPELFQGKKYDGPEVDVWSLG 
VI LYTLVSGSLPFDGQNLKELRERVLRGKYRI PFYMSTDCENLLKRFLVLNPI KRGTLEQ 
IMKDRWINAGHEEDELKPFVEPELDISDQKRIDIMVGMGYSQEEIQESLSKMKYDEITAT 
YLLLGRKSSELDASDSSSSSNLSLAKVRPSSDLNNSTGQSPHHKVQRSVSSSQKQRRYSD 
HAGPAIPSWAYPKRSQTSTADGDLKEDGISSRKSSGSAVGGKGIAPASPMLGNASNPNK 
ADI PERKKSSTVPSSNTASGGMTRRNTYVCSERTTADRHSVIQNGKENSTI PDQRTPVAS 
THS I SSAATPDRIRFPRGTASRSTFHGQPRERRTATYNGPPASPSLiSHEATPLSQTRSRG 
STNLFSKLTSKLTRSRNVSAEQKDENKE^ 

ANNCDYEQRERFLLFCVHGDGHAENLVQWEMEVCKLPRLSLNGVRFKRISGTSIAFKNIA 
SKI ANELKL 

SEQ IB NO: 155_W90839_M 

KGPSWSSRSLGARCRNSIASCPEEQPHVGNYRLLRTIGKGNFAK^KLARHILTGREVAIK 
IIDKTQL^PSSLQKLFREVRIMKGI^PNIVKLFEVIETEKTLYLVMEYASAGEVFDYLV 

SHGRMKEKEARAKFRQIVSAVHYCHQKNIVHRDLKAENLLLDAEANIKIADFGFSNEFTL 
GSKLDTFCGSPPYAAPELFQGKKYDGPEVDIWSLGVILYTLVSGSLPFDGHNLKELRERV 
LRGKYRVPFYMSTDCES I LRRFLVLNPAKRCTLEQI MKDKW INI GYEGEELKPDTELKEE 
RMPGRKASCSAVGSGSRGLPPSSPMVSSAHNPNKAEIPERRKDSTSTPNNLPPSMMTRRN 
TYVCTERPGSERPSLLPNGK^NSSGTSRVPPASPSSHSLAPPSGERSRIoARGSTIRSTFH 
GGQVRDRRAGSGSGGGVQNGPPASPTLAHEAAPLPSGRPRPTTNLFTKLTSKLTRRVTDE 

periggpevtschlpwdktetaprllrfpwsvkltssrps 
seq id no: 1 5 6_4 0 6 7 8 6 . 5_h 

mevggltvfeedqrclsqslplpvsaegpaaqttaepsrsfssahrhlsrrnglsrlcqs 
rtalsedrwssyclsslaaqnictsklhcpaapehtdpseprgsvsccsllrglssgwss 
pllpapvcnpnkaiftvdaktteilvand^ 

alseehmeadghaawfgtvvdi itrsgeki pvsvwmkrmrqerrlccvwlepvervst 
va/afqsdgtitscdslfahlhgyvsgedvagqhitdlipsvqlppsgqhipknlkiqrsv 
graridgttfplslklksqpsseeattgeaapvsgyrasvwvfctisglitllpdgtihgi 
nhsfaltlfgygktellgknitflipgfysymdlaynsslqlpdlascldvgnesgcger 
tldpwqgqdpaeggqdprinvvlagghwprdeirklmesqdiftgtqteliaggqllsc 
lspqpapgvdnvpegslpvhgeqalpkdqqitalgreepvaiespgqdllgesrsepvdv 

kpfascedseapvpaedggsdagmcglcqkaqlermgvsgpsgsdlwagaavakpqakgo 
laggsllmhcpcygsewglwwrsqdlapspsgmaglsfgtptldepwlgvendreelqtc 
likeqlsqlslagaldvphaelvptecqavtapvsscdlggrdlcggctgsssacyalat 
dlpggleaveaqevdvnsfswnlkelffsdqtdqtssncscatselretpsslavgsdpd 
vgslqeqgscvlddrelllltgtcvdlgqgrrfrescvghdpteplevclvssehyaasd 
respghvpstldagpedtcpsaeeprlnvqvtstpvivmrgaaglqreiqegaysgscyh 
rdglrls iqfevrrvelqgptplfccwlvkdllhsqrdsaartrlflaslpgsthstaae 
ltgpslvevlrarpwfeeppkaveleglaacegeysqkystmsplgsgafgfvwtavdkg 
kn?:e\^/kfikkekvledcv;iedp?clgk\^tleiailsrvehani ikvldifenqgffqlv 
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FIGURE II 

MEKHGSGLDLFAFIDRHPRLDEPLASYIFRQVRAGQSRLVSAVGYLRLKDI IHRDIKDEK 
I VIAEDFTIKLIDFGSAAYLERGKLFYTFCGTIEYCAPEVLMGNPYRGPELEMWSLGVTIj 
YTLVFEENPFCELEETVEAAIHPPYLVSPCELMSLVSGLLQPVPERRTTLEKLVTDPWVTQ 
PVNLADYTWEEVFRVNKPESGVLSAASLEMGNRSLSDVAQAQELCGGPVPGEAPNGQGCL 
HPGDPRLLTS 

SEQ ID NO: 157_AA544838_M 406786_M 

TRPHPCLDEPLASFIFRQLVSAVGYLHSQGI IHRDIKDENI VIAEDFTI KLIDFGSAAYL 
ERGKLFYTFCGTIEYCAPEVLIGNPYRGPELEMWSLGVTLYTLIFEENPFCEVEETMEAV 
IHPPFLVSQELMSLLSGLLiQPCPEQRTTLEKLTRDPWVTQPVNLASYTWEEVCRTNQPES 
GLLSAASLEIGSRSPSEMAQREGLCGPPAPRETRGDQHCLHLKDPSLPVS 

SEQ ID NO: 1 5 8_AA7 8 573 5_H 

KVMAIDGPRHLQRGPVRVGFYDIEGTLGKGNFAVVKIjGRHRITKTEVAIKI idksqldavn 
LEKI YREVQ I MKMLDHPH 1 1 KLYQVMETKSMLYLVTEYAKNGE I FD YLANHGRLNE S EAR 
RKFWQILSAVDYCHGRKI VHRDLKAENLLLDNNMNIKIADFGFGNFFKSGELLATWCGSP 
PYAAPEVFEGQQYEGPQLDIWSMGWLYVLVCGALPFDGPTLPILRQRVLEGRFRIPYFM 
SEDCEHLIRRMLVLDPSKRLTIAQIKEHKWMLIEVPVQRPVLYPQEQENEPSIGEFNEQV 
LRLMHSLGIDQQKXIESLQNKSYNHFAAIYFLLVERLKSHRSSFPVEQRLDGRQRRPSTI 
AEQTVAKAQTVGLPVTMHSPNMRLLRSAL^ 

PKVNGCLLDPVPPVLVRKGCQSLPSNMMETSIDEGLETEGEAEEDPAHAFEAFQSTRSGQ 

RRHTLSEVTNQLVVMPGAGKIFSMNDSPSLDSVDSEYDMGSVQRDLNFIjE^ 

Ai^QPSPRMTSPFISLRPTNPAMQALSSQKREVHNRSPVSFREGRRASDTSLTQGIVAFRQ 

HLQNIJ\RTKGILELNKVQLLYEQIGPEADPNLAPAAPQLQDLASSCPQEEVSQQQESVST 

LPASVHPQLSPRQSLETQYLQHRLQKPSLLSKAQNTCQLYCKEPPRSLEQQLQEHRLQQK 

RLFLQKQSQLQAYFNQMQIAESSYPQPSQQLPLPRQETPPPSQQAPPFSLTQPLSPVLEP 

SSEQMQYSPFLSQYQEMQLQPLPSTSGPRAAPPLPTQLQQQQPPPPPPPPPPRQPGAAPA 

PLQFSYQTCELPSAASPAPDYPTPCQYPVDGAQQSDLTGPDCPRSPGLQEAPSSYDPLAL 

S ELPGLFDCEMLDAVDPQHNGYVLVN 

SEQ ID NO: 1 5 9_AA2 0 72 2 0_H 

MESLVFARRSGPTPSAAELARPKAEGLIKSPKPLMKKQAVKRHHHKHNLRHRYEFLETLG 
KGT YGKVKKARE S SGRLVA I KS I RKDKI KDEQDLMH I RRE IEIMSS LNH PH 1 1 A I HE VFE 
NSSKI VIVMEYASRGDLYDYI SERQQLSEREARHFFRQI VSAVHYCHQNRVVHRDLKLEN 
ILLDANGNIKIADFGLSNLYHQGKFLQTFCGSPLYASPEIVNGKPYTGPEVDSWSLGVLL 
YILVHGTMPFDGHDHKILVKQISNGAYREPPKPSDCLXGLIRWLLMVNPTRRATLEDVAS 
HWWVNWGYATRVGEQEAPHEGGHPGSDSARASMADWLRRSSRPLLENGAKVCSFFKQHAP 
GGGSTTPGLERQHSLKKSRKENDMAQSLHSDTADDTAHRPGKSNLKLPKGILKKKVSASA 
EGVQEDPPELSPI PASPGQAAPLLPKKGI LKKPRQRESGYYSSPEPSESGELLDAGDVFV 
SGDPKEQKPPQASGLLLHRKGILKLiNGKJFSQTALELAAPTTFGSLDEIxAPPRPLARASRP 
SGAVSEDSILSSESFDQLDLPERLPEPPLRGCVSVDNLTGLEEPPSEGPGSCLRRWRQDP 
LGDSCFSLTDCQEVTATYRQALRVCSKLT 

SEQ ID NO: 1 6 0_AA4 2 6 5 8 0_H , MAK_V_H 

MPAAAGDGLLGEPAAPGGGGGAEDAARPAAACEGSFLPAWVSGVPRERLRDFQHHKRVGN 
YLIGSRKLGEGSFAKVREGLHVLTGEKVAI KVIDKKRAJCKDTYVTKNLRREGQIQQMIRH 
PNITQLLDILETENSYYLVMELCPGGNLMHKIYEKKRLEESEARRYIRQLISAVEHLHRA 
GWHRDLKIENLLLDEDNNI KLIDFGLSNGAGI LGYSDPFSTQCGS PAYAAPELLARKKY 
GPKIDVWSIGVNMYAMLTGTLPFTVEPFSLRALYQKKVDKEMNPLPTQLSTGAISFLRSL 
LEPDPVKRPNIQQALANRWLNENYTGKVPCNVTYPNR I SLEDLS PSWLHMTEKLGY^S 
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FIGLRK 1J 

DYINTYLSNTRACKILAI YFLLNKKLERYLSGKSDI QDSLCYKTRLYQI EKYRAPKESYEA 
S LDT WT RD L E FHA VQ D KK ? KE Q E KR G D F LH R P F S KKLD KN L P S H KQ ? S G S LMT Q 1QK T KA 
LLKDRKASKSSFPDKDS FGCRN I FRKTSDSNCVASSSMEFI PVP PPRTPRI VKKPEPHQF 
GPGSTG Z PHKEDPLMLDMYRSFESYDRDDHVEVLS PSHHYRI LNSPVSLARRXSSERTLS 
FGLPSGSMS PLHTPLHPTLVSFAFIEDKNS PPKEEGLCCPPPVPSNGPMQPLGS PNCVKSR 
GRFPMMG IGQKLRKRHQSLQPSADRFLEASLPPLQPIAPVNI^FDMADGVKTOG 

SEQ ID NO: 161_Z36720_H 

MDTK1NML.NEKVDQLLHFQEDVTEKLQSMCRDMGHLERGLHRLEASRAPGPGGADGVPHI 
DTQAGWPEVLELV^AMQQDAAQHGARLEALFRMVAAVDRAIALVGATFQKSKVA^ 
RVPWRRGSPGDSPEEWVKEEEVCFMPPVPPAPGAAGQSLQKDKGELSAEQGIWATLMTLV 
I1WTAANK£RVEEEGGKPKHVLSTSGVQS 

ADPAQAWSPGQGDGVPGPAQAFPGHLPLPTKVEAKAPETPSENLRTGLELAPAPGRVNV 
VSPSLEVAPGAGQGASSSRPDPEPLEEGTRLTPGPGPQGPGPPGLPAQARATHSGGETPP 
RAALLKGAVAPGFSRRDLVFPS I FCACLG I S I H I QEMDTPGEMLMTGRGSLGPTLTTEAP 
AAAQPGKQGPPGTGRCLQAPGTEPGEQTPEGARELSPLQESSSPGGVKAEEEQRAGAEPG 
TRPSLARSDDNDHEVGALGLQQGKSPGAGNPEPEQDCAARAPYRAEAVRRMPPGAEAGSY 
VLDDSPAPPAPFEHRWSVKETSISAGYEVGQHEVLGGGRFGQVHRGTEKSTGLPLAAKI 
IKVKSAKDREDVKNEINIMNQLSITVTSILIQLYDAFESK^ 

YHLTELDVVLFTRQICEGVHYLHQHYILHLDLKPENI LGVNQTGHQIKI I DFGLARRYKP 
REKLKVNFGTPEFLAPEVV^EFVSFPTDMWSVGVIT^ 

NCSWDFDADTFEGLSEEAKDFVSRLLVKEKSCRMSATQCLKHEWIJWLPAKASRSKTRLK 
SOLLLQKYIAQRKWKKHFYWTAANRLRKFPTSP 

SEQ ID NO: 162_SGK088_H 

GEMALFECLVAGPTDVE\nDWLCRGRLLQPALLKGKMHFDGRKCKLLLTSVHEDDSGVYTC 
KX,STAKDELTCSARLTVRPSLAPLFTRLLEDVEVLEGRAARFDCKISGTPPPWTWTHFG 
GPMEESENLRLRQDGGLHSLHIAHVGSEDEGLYAVSAVNTHGQAHGSAQLYVEEPRTAAS 
GPSSKLEKMPSIPEEPEQGELERLSIPDFLRPLQDLEVGLAKEAMLECQVTGLPYPTISW 
FHNGHRIQSSDDRRMTQYRDVHRLVFPAVGPQHAGVTKSVIANKLGKAAGYAHLYVTDW 
PGPPDGAPQWAVTGRMVTLTWNPPRSLDMAIDPDSLT^ 

PGWAATGLRKGVQHIFRVLSTTVKSSSKPSPPSEPVQLLEHGPTLEEAPANLDKPDIWV 
VEGQPASVTVTFNHVEAQVVWRSCRGAXfLEARAGV^ELSQP 

LTCTARNRHGTQTGS VTLELAEAPRFES I MEDVEVGAGETARFAVWEGKPLPD I MWYKD 
EVLLTESSHVSFWEENECSLVVLSTGAQDGGVYTGTAQNI^GEVSCKAEl^VHSAQTAM 
EVEGVGEDEDHRGRRLSDFYDIHQEIGRGAFSYLRRI VERSSGLEFAAKFI PSQAKPKAS 
ARREARLLARLQHDCVLYFHEAFERRRGLVIVTELGTEELLERIARKPTVCESEIRAYMR 
QVLEGIHYLHQSHVLHLDVKPENLLVWDGAAGEQQVRICDFGNAQELTPGEPQYCQYGTP 
EFVAPEIVNQSPVSGVTDIWPVGWAFLGLTGISPFVGENDRTTLMNIRNYNVAFEETTF 
LSLSREARGFLIKVLVQDRLRPTAEETLEHPWFKTQAKGAEVSTDHLKLFLSRRRWQRSQ 
I SYKGHLVLRPI PELLRAPPERVWVTMPRRPPPSGGLSSSSDSEEEELEELPSVPRPLQP 
EFSGSRVSLTDI PTEDEALGTPETGAATPMDWQEQGRAPSQDQEAPSPEALPSPGQEPAA 
GASPRRGELRRGSSAESALPRAGPRELGRGLHKAASVELPQRRSPGPGATRLARGGLGEG 
EYAQRLQALRQRLLRGGPEDGKVSGLRGPLLESLGGRARDPRMARAASSEAAPHHQPPLE 
NRGLQKSSSFSQGEAEPRGRHRRAGAPL.EI PVAJRLGARRLQESPSLSALSEAQPSSPARP 
SAPKPSTPKSAEPSATTPSDAPQPPAPQPAQDKAPEPRPEPVRASKPAPPPQALQTLALP 
LTPYAQI IQSLQLSGHAQGPSQGPAAPPSEPKPHAAVFARVASPPPGAPEKRVPSAGGPP 
VLAEKARVPTVPPRPGSSLS SSI ENLESEAVFEAKFKRSRESPLSLGLRLLSRSRSEERG 
PFRGAEEEDGI YRPSPAGTPLELVRRPERSRSVQDLRAVGEPGLVRRLSLSLSQRLRRTP 
PAQRHPAWEARGGDGESSEGGSSARGSPVLtAMRRRLSFTLERLSSRLQRSGSSEDSGGAS 
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FIGURE IK 

GRSTPLFGRLRRATSEGESLRRLGLPHNQLAAQAGATTPSAESLGSEASATSGSSAPGES 
RSRLRWGFSRPRKDKGLSPPNLSASVQEELGHQYVRSESDFPPVFHI KLKDQVLLEGEAA 
TLLCLPAACPAPHI SWMKDKKSLRSEPSVI I VSCKDGRQLLS I PRAGKRHAGLYECSATN 
VLGSITSSCTVAVARVPGKI,APPEVTQTYQDTALVLWKPGDSRAPCTYTLERR r /DGESVW 
HPVSSGI PDCYYNVTHLPVGVTVRFRVACAN RAGQGPFSNSSEKVFVRGTQDS SAVPSAA 
HOEAPVTSRPARARPPDSPTSLAPPLAPAAPTPPSVTVSPSSPPTPPSQALSSLKAVGP? 
PQTPPRRHRGLQAARPAEPTLPSTHVTPSEPKPFVLDTGTPIPASTPQGVKPVSSSTPVY 
WTSFVSAPPAPEPPAPEPPPEPTKVTVQSLSPAKEWSSPGSSPRSSPRPEGTTLRQGP 
PQKPYTFLEEKARGRFGWRACRENATGRTFVAKIVPYAAEGKPRVLQEYEVLRTLHHER 
IMSLHEAYITPRYLVLIAESCGNRELLCGLSDRFRYSEDDVATYMVQLLQGLDYLHGHHV 
LHLDIKPDNLLIJ^PDNAJ^KIVDFGSAQPYNPQALRPLGHRTGTLEFMAPEKVKGEPIGSA 
TDIWGAGVLTYIMLSGRSPFYEPDPQETEARIVGGRFDAFQLYPNTSQSATLFLRKVLSV 
HPWSRPSLQDCLAHPWLQDAYLMKLRRQTLTFTTNRLKSFLGEQRRRRAEAATRHK^v/LL^ 

SYPGGP 

SEQ ID NO: 1 6 3__AA5 4 2 0 1 5_M SGK0 8 8_M 

ATDIWGAGVLTYIMLSGYSPFYEPDPQETEARIVGGRFDAFQLYPNTSQSATLFLRKVLS 

VHPWSRPSLQDCLAHPWLQDAYLMKi^ 

RSYPGSP 

SEQ ID NO: 164_R19772_H 

MKGGDRAYTRGPSLGWLFAKCCCCFPCRDAYSHSSSENGGKSESVANL.QAQPSLNFIHSS 
PGPKRSTNTLKKWLTSPVRRL^SGKADGNIKK^^ 

ADESKKGWGEDEPDEESHTPLPPPMKIFDNDPTQDEMSSSLLAARQASTEVPTAADLVNA 
IEKLVKNKLSLEGSSYRGSLKDPAGCLNEGl^^ 

QTEKDYVKDLGIVVEGFMKRIEEKGVPEDMRGKDKIVFGNIHQIYDWHKDFFLJVELEKCI 
QEQDRLAQLFIKHERKLHI YVWYCQNKPRSEYI VAEYDAYFEEVKQEINQRLTLSDFLIK 
PIQRITKYQLLLKDFLRYSEKAGLECSDIEKAVELMCLVPK^ 

AQGKLLQQDTFYVIELDAGMQSRTKERRVFLFEQIVIFSELLRKGSLTPGYMFKRSIKMN 
YLVLEENVDNDPCKFALMNRETSERVVLQAANADIQQAWVQDINQVLETQRDFLNALQSP 
IEYQRKERSTAVMRSQPARLPQASPRPYSSVPAGSEKPPKGSSYNPPLPPLKISTSNGSP 
GFEYHQPGDKFEASKNDLGGCNGTSSMAVIKDYYALKENEICVSQGEVVQVLAVNQQNMC 

LVYQPASDHS PAAEGWVPGS I LAPLTKATAAESSDGS I KKSCSWHTLRMRKRAEVENTGK 
NEATGPRKPKDILGNKVSVKETNSSEESEGDDLDPNTSMEIL^PNFIOEVAPEFLVPLVD 

VTCLLGDTVILQCKVCGRPKPTI TWKGPDQNI L.DTDNS SATYTVSSCDSGE I TLKI CNLM 
PQDSGIYTCIATNDHGTTSTSATVKVQGVPAAPNRPIAQERSCTSVILRWLPPSSTGNGT 
ISGYTVEYREEGSQIWQQSVASTLDTYLVIEDLSPGGPYQFRVSASNPWGISLPSEPSEF 

VRLPEYTDAAADGATI SWKENFDSAYTELNE IGRGRFS I VKKC I HKATRKDVAVKFVNKKM 
KKKEQAAHEAALLQHLQHPQYITLHDTYESPTSYILILELMDDGRLLDYLMNHDELMEEK 
VAFYIRDIMEALQYLHNCRVAHLDIKPENLLIDLRIPVPRVKLIDLEDAVQISGHFHIHH 
LLGNPEFAAPEVIQGI PVSLGTDI WSIGVLTYVMLSGVSPFLDESKEETCINVCRVDFSF 
PHEYFCGVSNAARDFINVILQEDFRRRPTAATCLQHPWLQPHNGSYSKIPLDTSRLACFI 

ERRKHQNDVRP I PNVKS Y I VNRVNQGT 
SEQ ID NO: 1 6 S_5R? ?_8_2_H 

MADSGLDKKSTKCPDCSSASQKDVLCVCSSKTRVPPVLWEMSQTSSIGSAESLISLERK 
KEKNINRDITSRKDLPSRTSNVERKASQQQWGRGNFTEGKVPHIRIENGAAIEEIYTFGR 
I LGKGSFGI VI EATDKETETKWAJ KKVNKEKAGSSAVKLLEREVNI LKSVKHEHI I HLSC 
VFETPKKMYLVMELCEDGELKEI LDRKGHFSENETRWI IQSLASAI AYLHNNDI VHRDLK 
LENIMVKSSLIDDNNEINIJ^IKVTDFGLiAVKKQSRSEAMLOATCGTPIYMAPEVISAHDY 
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FIGURE 1L 

sqqcdi v;sig\^iymllrgeppflj\ssea?;lfelirkgelhfenawnsisdca:<sv^kc 
lmkvdpahritakelldnqwltgn^ 

teekljksyopvjgnvpetn^tsdeeeekqstayekqfpatskdnfdkcsssftsskllfai: 
ikgemektpvtpsqgtatkypaksgalsrtkkkl 

seq id no: i66_sgk309_h 

mq c laaal kd etnms gggeq ad i l p anywkdr w kvl kkiggggfgei yeandlltremv 

ALKVESAQQPKQVLKMEVAVIjKXLQGSGLGQGDGKEEMMKPGAK^ 

KFNYVVMQLQGRNLADLRRSQPRGTFTLSTTLRLGKQILESIEAIHSVGFLHRDIKPSNF 
AMGRLPSTYRKGYMLDFGLJ^QYTNTTGDVR 

DLWSLFYMLVEFAVGQLPWRKIKDKEQVGMIKEKYEHRMLLKHMPSEFHLFLDHIASLDY 
FTKPDYQLIMSVFENSMKERGIAENEAFDWEKAGTDALLSTSTSTPPPAEHPADGSHVWG 
GQCDASAWGPAPGEHRGCATGRAPEXPGECTPNSAREALXGAGPQSPPCPPPRGSXGXSL 
GGDRCQPEQTPDQHRQSNCRQGEGRGWPFLSPPIPSLVPLPCSSXAPCPPPISLLARPLF 
PVPSPALASLCLPSSSSSVSFTLRRPSA 

SEQ ID NO: 167_AA234451_K 

MSGGGEQLDILSVGILVKERWKVLRKIGGGGFGEI YDALDMLTRENVALKVESAQQPKQV 
LKMEVAVLKKLQGKDKVCRFIGCGRNDRFNYVVMQLQGRNLADLRRSQSRGTFTI STTLR 
LGRQILESIESIHSVGSXHRDIKPSNFAMGRFPSTCRKCYMLDFGLARQFTNSCGDVRPP 
RAVAGFRGTVRYASINAHRNREMGRHDDLWSLFYMLVEFVVGQLPWRKIKDKEQVGSIKE 
RYDHRLMLKHLPPEFSI FLDHI SSLDYFTKPDYQLLTSVFDNSIKTFGVIESDPFDWEKT 
GNDGSLTTTTTSTTPQLHTRLTPAAIGIANATPI PGDLLRENTDEVFPDEQLSDGENGI P 
VGVSPDKLPGSLGHPRPQEKDVWEEMDANKNKIKLGICKAATEEENSHGOANGLLiNAPSL 
GSPI RVRS E I TQPDRDI PLVRKLRS I HS FELEKRLTLEPKPDTDKFLETWYKI VYFSF 

SEQ ID NO: 168_AA435956_H 
^rv^.r, <«,-. rv o*app«rp.j p pEMTVFDLEAKS ARGGSNLLMDS VS S FQLFMFQLLRGLAY I HHQHVLHRDLKPQN 

LLISHLGELKLiADFGLARAKS I PSQTYSSEWTLWYRPPDALLGATEYSSELDIWGAGCI 
FIEMFQGQPLFPGVSNILEQLEKIWEVLGVPTEDTWPGVSKLPNYNPEWFPLPTPRSLHV 
VWNRLGRVPEAEDLASQMLKGFPRDRVSAQEALVHDYFSALPSQLYQLPDEESLFTVSGV 
RLKPEMCDLLASYQKGHHPAQFSKCW 

SEQ ID NO: 169_AA626859_H 

NGVADGVIKSVLWQTLQALNFCHIHNGIHRDI KPENILITKQGI I KICDFGFAQILI PGD 
AYTDYVATRWYRAPELLVGDTQYGSSVDIWAIGCVFAELLTGQPLWPGKSDVDQLYLIIR 
TLGKLI PRHQS I FKSNGFFHGI S I PEPEDMETLEEKFSDVHPVALNFMKGCLKMNPDDRL 
TCSQLLESSYFDSFQEAQI KRKARNEGRNRRRQQNQLLPLI PGSHI SPTPDGRKQVLQLK 

FDHLPNI 

SEQ ID NO: 1 7 0_AA06 1 7 9 7_M 

KIALREIRMLKIKHPNLVNLIEVFRRKRKMHLVFEYGDHTLL^ELERNPNGVSDGVIKSV 
LWQTLQALNFGHKHNCI HRDVKPENI L I TKQGMI KI CDFGFARI LI PGDAYTDYVATRW Y 
RAPELLVGDTKYGSSVDVWAVGCVFAELLTGQPLWPGKSDVDQLYLI IRTLGKLI PRHQS 
IFRSNQFFRGISIPEPEDMETLEEKFSNVQPVALSFMKGCLKMNPDERLTCAQLLDSAYF 
ESFQEDQMKRKARSEGRSRRRQQNQLLPLI PGSHI SPTPDGRKQWQLKFDHLPNI 

SEQ ID NO: 171_AA397553_H 

MPNSERHGGKKDGSGGASGTLQPSSGGGSSNSRERHRLVSKHKRHKSKHSKDMGLVTPEA 
ASLGTVI KPLVEYDDI SSDSDTFSDDMAFKLDRRENDERRGSDRSDRLHKHRHHQHRRSR 
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FIGURE 1M 

DLLKAKQTEKEKSQEVSSKSGSMKDRISGSSKRSNEETDDYG? F vAQVAKSSSKESRSSKLH 

KEKTRKERELKSGHKDRSKSKRKRETPKSYKTVDSPKRRSRSPHRKV7SDSSKQDDSPSGA 

SYGQDYDLSPSRSHTSSNYDSYKKSPGSTSRRQSVSPPYKEPSAYQSSTRSPSPYSRRQR 

SVSPYSRRRSSSYERSGSYSGRSPSFYGRRRSSSPFLSKRSLSRSPLPSRKSMKSRSRSP 

AYSRHSSSHSKKKRSSSRSRHSSISPVRLPLNSSLGAELSRKKKEPAAAAAAAKMDGKES 

KG£PVFLPRKENSSVEAKDSGLESKKLPRSVI<^EKSAPDTELV^ r THLK T TEVKNSSDTGK 

VKLDENSEKHLVKDLKAQGTRDSKPIALKEEI VTPKETETSEKETPPPLPTI ASPPPPLP 

TTTPPPQTPPLPPLPPI PALiPQQPPLPPSQPAFSQVPASSTSTLPPSTHSKTSAVSSQAN 

SQPPVQVSVKTQVSVTAAIPHLKTSTLPPLPLPPLLPGGDDMDSPKETLPSKPVKKEKEQ 

RTRHLLTDLPLPPELPGGDLSPPDSPEPKAITPPQQPYKKRPKICCPRYGERRQTESDWG 

KRCVDKFDI I G 1 1 GEGTYGQVYKAJ^KDTGELVALKKVRLDNEKEGF P I TA I RE I KI LRQ 

LIHRSWNMKEIVTDKQDAiDFKKDKGAFYL^^ 

KQLMEGLEYCHKKtfFLHRDIKCSNILLNN 

YRPPELLL,GEERYTPAIDVWSCGCILGELFTKKPIFQANLELAQLELISRLiCGSPCPAVW 
PDVIKLPYFNTMKPKKQYRRRLREEFSFIPSAALDLLDHMLTLDPSKRCTAEQTLQSDFL 
KDVELSKMAPPDLPHWQDCHELWSKKRRRQRQSGVVVEEPPPSKTSRKETTSGTSTEPVK 
NSSPAPPQPAPGP^SGAGDAIGI^ITQQLNQSEI^VLI^LLQSQTDLSIPQMAQLLNI 
HSNPEMQQQLEALNQS I SALTEATSQQQDSETMAPEESLKEAPSAPVI LPSAEQMTLEAS 
STPADMQNIIJ^VLLSQLMKTQEPAGSLEENNSDKISrSGPQGPRRTPTMPQEEAAACPPHIL 
PPEKRPPEPPGPPPPPPPPPLVEGDLSSAPQELNPAVTAALLQLLSQPEAEPPGHLPHEH 
QALRPMEYSTRPRPNRTYGNTDGPETGFSAIDTDERNSGPALTESLVQTLVKNRTFSGSL 
SHLGESSSY(^TGSVQFPGDQDLRFARVPI^L.HPWGQPFLKAEGSSNSVVHAETK1 J QNY 
GELGPGTTGASSSGAGLHWGGPTQSSAYGKLYRGPTRVPPRGGRGRGVPY 

SEQ ID NO: 172__AA78923 9_H 

MEMYETLGKVGEGSYGTVMKCKHKNTGQI VAI KI FYERPEQSVNKI AMREI KFLKQFHHE 
NLVNLIEVFRQKKKIHLVFEFIDHTVLDELQHYCHGLESKRLRKYLFQILRAIDYLHSNN 
VIIHRDIKPENILVSQSGITKLCDFGFARTLlAAPGDIYTDYVATRWYRAPELVLKDTSYG 
KYVPVDIWAIjGCMI IEMATGNPYLPSSSDLDLLHKIVLK^^ 
KPKESSKENELRKEERKTVYTNTL^ 

KKKEYEGGLGQQDANENVHPMSPDTKLVTIEPPNPINPSTNCNGLKENPHCGGSVTMPPI 
NLTNSNLMtfU^LSSNLFHPSVRLT^ 

FNERTGHSDQMANENKRKLNFSRSDRKEFHFPELPVTIQSKIDTKGMEVK^ 
TESSKIPTLLNVDQNQEKQEFIPLSLLSACCPIFTNICSQLTIRVEMAIARGRI 

SEQ ID NO: 173_AA124 976_M 
LAIDIVHACLQIDPAERTSSTDLLRHDYFTRDGFIEK^ 

FKENEPWDEKKSVFTNTLLYGNPSLYGK^VDRDKRAKELKVRVIKAKGGKGDVPDQKKP 
EYEGDHRQQGTADDTQPSSL.DKKPSVLELTNPLNPSENSDGVKEDPHAGGCMIMPPINLT 
SSNLLAANLSSNLSHPNSRLTERTKKRRTSSQTIGQTLSNSRQEDTGPTQVQTEKGAFNE 
RTGQNDQISSGNKRKLNFPKCDRKEFHFPELPFTVQAKEMKGMEVKQI KVLKRESKKTDS 
SKI PTLLSMDPNQEKQEGGDGDCEGKNLKRNRFFFSR 

SEQ ID NO: 174_AA575635_M CCRK_M 

SASGQLKIADFGLARVFSPDGGRLYTHQVATRWYRAPELLYGARQYDQGVDLWAVGCIMG 
ELLNGSPLFPGENDIEQLCCVLRILGTPSPRVWPEITELPDYNKISFEEQAPVPLEEVLP 
DASPQALDLLGQFLLYPPRQRIAASQALLHQYFFTAPLPAHPSELPI PQRPGGPAPKAHF 
GPPHVKDFHVDRPIEESLLNPELIRPFI PEG 
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FIGURE IN 

SEC- ID NO: 17 5_AA63 1990_H 

MITSISTEKSGHTKYFFMITTLQYYRGRGGKTAVWRHFSAEGPFAFAEMRHSKRTHCPDV; 
DSRESWGHESYRGSHKRKRRSHSSTQENRHCKPHHQFKESDCHYLEARSLNERDYRDRRY 
\T)EYRNDYCEGYVPRHYHRDIESGYRIHCSKSSVRSRRSSPKRKRNRHCSSHQSRSXEIV 
DTLGEGAFGKVVECIDHGMDGMHVAVKIVKNVGRYREAARSEIQVLEHLNSTDPNSVFRC 
VQMLEWFDHHGHVGIVFELLGLSTYDFIKENSFLPFQIDHIRQMAYOICQSINFLHHNKL 
THTDLKPENILFVKSDYWKYNSKMKi^ERTLKNTDIKVVDFGSATYDDEHHSTLVSTRH 
YRAPEVI LALGWSQPCDVWS I GC I LI EYYLGFTVFQTHDSKEHLAMMERI LGP I PQHKI Q 
KTRKRKYFHHNQLDWDEHSSAGRYVRRRCKPLKEFMLCHDEEHEKLFDLVRRMLEYDPTQ 
RITLDEALQHPFFDLLKKK 

SEQ ID NO: 1 7 6_AA5 5 7 5 3 6_H 

MCTVVDPRIVRRYLLRRQLGQGRTFREITLLQVSGLGPPVQSPCPGTDLSRQERNWPSWA 
PEHSPSWPSSRLRLSPQEFGDHPNI I SLLDVI RAENDRDI YLVFEFMDTDLNAVIRKGGL 
LQDVHVRS I FYQLLRATRFLHSGHWHRDQKPSNVLLDANCTVKLCDFGLARSLGDLPEG 
PEDQAVTEYVATRWYRAPEVLLSSHRYTASCPRYTLGVDMWSLGGILGEMLRGRPLFPGT 
STLHQLELILETI PPPSEEXRPRQTLDALLPPDTSPEALDLLRRLLVFAPDKRLSATQAL 
QHPYVQRFHCPSDEWAREADVRPRAHEGVQLSVPEYRSRVYQMILECGGSSGTSREKGPE 
GVS PSQAHLHKPRADPQLPSRTPVQGPRPRPQSSPGHDPAEHESPRAAKNVPRQNSAPLL 
QTALLGNGERPPGAKEAPPLTLSLVKPSGRGAAPSLTSQAAAQV/^NQALIRGDWNRGGGV 
RVASVQQVPPRLPPEARPGRRMFSTSALQGAQGGARALLGGYSQAYGTVCHSALGHLPLL 
EGHHV 

SEQ ID NO: 1 7 7_N2 8 6 0 6_H , MOK_H 
MKNYKAIGKIGEGTFSEVMKMQSLRJDGN^ 

PNILMLHEVVFDRKSGSLiALIGELMDMNI YELIRGRRYPLSEKXIMHYMYQLCKSLDHIH 
RNG I FHRDVKPENI L I KQDVLKLGDFGSGRS VYSKQP YTEYI STRWYRAPECLLTDGFYT 
^Y^DLWSAGCVFYEI ASLQPLFPGVNELDQI SKIHDVIGTPAQKILTKFKQSRAMNFDFP 
FKKGSGI PLLTTNLSPQGLSLLHAKVAYDPDERI AAHQALQHPYFQEQRKTEKRALGSHR 
KAGFPEHPVAPEPLSNSCQISKEGRKQKQSLKQEEDRPKRRGPAYVMELPKLKLSGVVRL 
SS YSSPTLQSVLGSGTNGRVPVLRPLKCI PASKKTDPQKDLKPAPQQCRLPTI VRKGGR 

SEQ ID NO: 178_AB023153_H, ICK_H 

MNRYTTI RQLGDGTYGSVLLGRS I ESGELI AI KKMKRKFYSWEECMNQREVKSLKKLNHA 
NVVKX.KEVIRENDHLYFIFEYMKI1NLYQLI 

FFHRDLKPENLLCMGPELVKI ADFGIxAJlEIRSKPPYTDYVSTRWYRAPEVLLRSTNYSSP 
I D VW A VGC I MAEVYTLRPLFPGAS EI DTI FKICQVLGTPKKTDWPEGYQLSSAMNFRWPQ 
CVPNNLKTLI PNASSEAVQLLRDMLQWDPKKRPTASQALRYPYFQVGHPLGSTTQNLQDS 
EKPQKGILERAGPPPYIKPVPPAQPPAKPHTRI SSRQHQASQPPLHLTYPYKAEVSRTDH 
PSHLQEDKPSPLLFPSLHNKHPQSKITAGLEHKNGEIKPKSRRRWGLISRSTKDSDDWAD 
LDDLDFSPSLSRIDLKNKICRQSDDTLCRFESVLDLKPSEPVGTGNSAPTQTSYQRRDTPT 
LRSAAKQHYLKHSRYLPGI S IRNGILSNPGKEFI PPNPWSSSGLSGKSSGTMSVI SKVNS 
VGSSSTSSSGLTGNYVPSFLKKEIGSAMQRVHLAPIPDPSPGYSSLKAMRPHPGRPFLDT 
QPRSTPGLI PRPPAAQPVHGRTDWASKYPSRR 

SEQ ID NO: 179_AA839940_K 

SSNNGGMSAEEEIGPGAEPMRGPSLATRDWRDETVGTTDLQQG I DPGAVS PEPGKDHAAQ 
GPGRTEAGRVSSAAEAAIWLDDSAAPPAPFEHRWSIKDTLISAGYTVSQHEVLGGGRF 
GQVHRCTERSTGLALAAKI I KVKNVKIjF.EOVKNEVNI MNQLSHVNL I QL YD A FES KNSFT 
L I MEYVDGGELFDR I TD EKYHLTELDVVLFTRQ I CEGVHYLHQHY I LHLDLKPEN I LCVS 
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FIGURE lO 

QTGHQIKIIDFGLARRYKPREKLKVNFGTPEFLAPEWNYEFVSFPTDMWSVGVITYMLI.- 
SGLSPFLGETDAETMNFIVNCSWDFDADTFKGLSEEAKDFVSRLLVKEKSCRMS/aTQCLK 
KEWLNHLPAKASGSNWLRSQQLLQKYMAQSKWKKHFHWAAVNRLRKFPTCP 

SEQ ID NO: 180_AA460132_H 

MAAARATTPADGEEPAPEAEALAAARERSSRFLSGLELVKQG/vEARVFRGRFOGRAAVIK 
HRFPKGYRHPALEARLGRRRTVQEARALLRCRRAGI SAPWFFVDYASNCLYMEE I EGS V 
TVRDYIQSTMETEKTPQGLSNLiAKTIGQVLARMHDEDLIHGDLTTSNMLLKPPLEQLNI V 
LIDFGLSFISALPEDKGVDLYVLEKAFLSTHPNTETVFEAFLKSYSTSSKKARPVLKKLD 

EVRLRGRKRSMVG 

SEQ ID NO: 181_SGK034_H 

QREKWQGNMPGLQSTFLAMDTEEGVEVVWNELHFGDRKAFAAHEEKIQTVFEQLVLVTDH 
PNIVKLHKYWLDTSEACARVIFITEYVSSGSLKQFLKKTKKNHKAMNARAWKRWCTQILS 
ALSFLHACSPPI IHGNLTSDTI FIQHNGLIKIGSVWHRIFSNALPDDLRSPIRAEREELR 
NLHFFPPEYGEVADGTAVDIFSFGMCALEMAVLEIQTNGDTRVTEEAIARARHSLSDPNM 
REFILCCLARDPARRPSAHSLLFHRVLFEVHSLKLLAAHCFIQHQYLMPENWEEKTKAM 
DLHAVLAELPRPRRPPLQWRYSEVSFMELDKFLEDVRNGI YPLMNFAATRPLGLPRVLAP 
PPEEVQKAKTPTPEPFDSETRKVIQMQCNLERSEDKARWHLTLLLVLEDRLHRQLTYDLL 
PTDSAQDLASELVHYGFLHEDDRMKLAAFLESTFLKYRGTQA 

SEQ ID NO: 182_AA103218_M SGK034_M 

HASAPEYGEVNDGTGFVD I FS FGMCALEMAVLE I QANGDTRVTEEAI ARARHS LSDPNMR 
EFILSCLARDPARRPSAHNLLFHRVLFEVHSLKLLAAHCFIQHQYLMPENVVEEKTKAMD 
LHAVLAEMPQPHGPPMQWRYSEVSFLELDKFLEDVRNGIYPLMNFAAARPLGLPRVLAPP 
PEEAQKAKTPTPEPFDSETRKWQMQCNLERSEDKARWHLTLLLVLEDRLHRQLTYDLLP 
TDSAQDLAAELVHYGFLHEDDRTKLAAFLETTFLKYRGTQA 

SEQ ID NO: 18 3_NEK7_H, N3 4132_H 

MSGGAAEKQSSTPGSLFLSPPAPAPKNGSSSDSSVGEKLGAAAADAVTGRTEEYRRRRHT 

MDKDSRGAAATTTTTEHRFFRRSVI CDSNATALELPGLPLSLPQPS I PAAVPQSAPPEPH 
REETVTATATSQVAQQPPAAAAPGEQAVAGPAPSTVPSSTSKDRPVSQPSLVGSKEEPPP 

ARSGSGGGS AKE PQEERSQQQDD I EELETKAVGMSNDGRFLKFDI E IGRGS FKTVYKGLD 
TETTVEVAWCELQDRKLTKSERQRFKEEAEMLKGLQHPNIVRFYDSWESTVKGKKCIVLV 
TELMTSGTLKTYLKRFKVMKIKVLRSWCRQILKGLQFLHTRTPLI IHRDLKCDNI FITGP 
TGSVKIGDLGLATLKRASFAKSVIGTPEFMAPEMYEEKYDESVDVYAFGMCMLEMATSEY 
PYSECQNAAQI YRRVTSGVKPAS FDKVAI PEVKE 1 1 EGC I RQNKDERYS I KDLLNHAFFQ 

EETGVRVELAEEDDGEKIAIKLWLRI EDI KKLKGKYKDNEAIEFCFDLERDVPEDVAQEM 
VESGYVCEGDHKTMAKAIKDRVSLIKRKREQRQLVREEQENKKQEESSLKQQVEQSSASQ 
TGIKQLPSASTGI PTASTTSASVSTQVEPEEPEADQHQQLQYQQPSISVLSDGTVDSGQG 

SSVFTESRVSSQQTVSYGFPXHEQAHSTGTVPGHIPSTVQAQSQPHGVYPPSSVQQGIQO 
TAPPQQTVQYSLSQTSTSSEATTAQPVSQPQAPQVLPQVSAGKQSTQGVSQVAPAEPVAY 

AQPQATQPTTLASSVDSAHSDVASGMSDGNENVPSSSGRHEGRTTKRHYRKSVRSRSRHE 

KTSRPKLRI LNVSNKGDRWECQLETHNRKMVTFKFDLDGDNPEE I AT I MVNNDF I LAI E 
RESFVDQVREI IEKADEMLSEDVSVEPEGDQGLESLQGKDDYGFSGSQKLEGEFKQPI PA 
SSMPQQIGIPTSSLTQWHSAGRRFIVSPVPESRLRESKVFPSEITDTVAASTAQSPGMN 
LSHSASSLSLQQAFSELRRAQMTEGPNTAPPNFSHTGPTFPWPPFLSSIAGVPTTAAAT 
APVPATSSPPNDISTSVIQSEVTVPTEEGIAGVATSTGWTSGGLPIPPVSESPVLSSVY 
SSITIPAWSISTTSPSLQVPTSTSEIWSSTALYPSVTVSATSASAGGSTATPGPKPPA 
WSQQAAGSTTVGATLTSVSTTTSFPSTASQLSIQLSSSTSTPTLAETWVSAHSLDKTS 

IS/H5 
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FIGURE IP 

HSTTTGLiAFSLSAFSSSSSPGAGVSSYI SC'PGGLKPLVI FSVI ASTPILPQAAGFTSTFL 
LPC^PSIPPLVQPVANYFAVQQTLIHSCPvPAL^ K 
TLEEKLRSLFSEKSSSGAOHASVSLETSLVIESTVTPGI PTTAVAPSKLLTSTTSTCLFP 
TNLFLGTVA1jPVTPVVTPGQVSTPV£TTTSGVKPGT7\PSKFPLTKAFVLFVGTELPAGTL 
PSEQLPPFPGPSLTQSQCFLEDLZ'AQLRRTLSPEKITVTSAVGPVSI^IAAPTAITEAGTQP 
QKGVSQVKEGPVLATSSGAGVFKMGRFQVSVAADGAQKEGKNKSEDAKSVHFESSTSESS 
VLSSSSPESTLVKPEPNG ITI PGI SSDVPESAHKTTASEAKSDTGQPTKVGRFQVTTTAN 
KVGRFSVSKTEDKITDTKKEGPVAS PPFMDLEQAVLPAVI PKKEKPELSEPSHLNGPSSD 
PEAAFLSRDVDDGSGSPHS PHQLSSKSLPSQNLSQSL.SNSFNSS YMSSDNESDI EDEDLK 
LELRRLRDKHLKEIQDLQSRQKHEIESLYTKLGKVPPAVIIPPAAPLSGRRRRPTKSKGS 
KSSRSSSLGNKSPQLSGNLSGQSAASVLHPQQTLHPPGNIPESGQNQLLQPLKPSPSSDN 
LYSAFTSDGAI SVPSLSAPGQGNKATI IVQKQ 

SEQ ID NO: 184_BCON3_H 

MSEGESQTVLSSGSDPKVESSSSAPGLTSVSPPVTSTTSAASPEEEEESEDESEILEESP 

CGRWQKRREEWQRNVPGIDSAYLAMDTEEGVEVVWNEVQFSERP^^ 

LIQLEHLNI VKFHKYWADI KENKARVI FITEYMSSGSLKQF 

CTQILSALSYLHSCDPPI IHGNLTCDTIFIQHNGLIKIGSVAPDTINNHVKTCREEQKNL 
HFFAPEYGEVTNVTTAVDI YSFGMCALEMAVLEI QGNGESS YVPQEAI SSAIQLLEDPLQ 
REFIQKCLQSEPARRPTARELLFHPALFEVPSLKLLlAAH^ 

DTSAVLAEIPAGPGREPVQTLYSQSPALELDKFLEDVRMGIYPLTAFGLPRPQQPQQEEV 
TSPWPPSVKTPTPEPAEVETRKWLMQCNIESVEEGVKHHLTLLLKLEDKLNRHLSCDL 
MPNENIPELAAELVQLGFISEADQSRLTSLLEETLNKFNFARNSTLNSAAVTVSS 

SEQ ID NO: 185_AA71182 9_M 

LKQFLKKTKKNHKTMNEKAWKRWCTQILSALSYLKSCDPPI IHGNLTCDTIFIQHNGLIK 
IGSVAPDTIN1<IHVKTCREEQK^ T LHFFAPEYGEVTWTTAVDIYSFGMCALEMAVLEIQGN 
^nnr-.n <m f v,^ ES s YVPQEA I S S A I QLLEDS LQRE F I QKCLQ S E PARR PTARE LLFH P ALFE VP S LKLLA 

AHC I VGHQHMI PEN ALE EI TKNMDTSAVLAE I PAGPGREP VQTLYSQS PALELDKFLEDV 
RNGIYPLTAFGLPRPQQPQQEEVTSPWPPSVKTPTPEPAEVETRKVVLMQCNIESVEEG 
VPCHHLTLLLKLEDKIjNRKLSCDLMPNESI PDLAA£LVQLGFI SEADQSRLTSLLEETLNK 
FNFTRNSTLNTATVTVS S 

SEQ ID NO: 186_AA099102_H 

MSSCVSSQPSSNRAAPQDELGGRGSSSSESQKPCEALRGLSSLSIHLGMESFIVVTECEP 
GCAVDLGLJVRDRPLEADGQEVPLDTSGSQARPHLSGRKLSLQE 

CICPSLPYSPVSSPQSSPRLPRRPTVESHPTVSITGMQDCVQLNQYTLKDEIGKGSYGVVK 
LJ^YNENDNTYYAMKVLSPCKKLIRQAAFPRRPPPRGTRPAPGGCIQPRGPIEQVYQEIAIL 
KKXDHPNWKiVEVLDDPNEDHLY^FELWQGPVMEVPTLKPLSEDQARFYFQDLIKGI 
EYLHYQKIIHRDIKPSNLLVGEDGHIKIADFGVSNEFKGSDALLSNYVGTPAFMAPESLS 
ETRKI FSGKAKDVWANiGVTLYCFVFGQCPFMDERIMCLHSKI KSQALEFPDQPDI AEDLK 
DLITR>1LDKNPESRIWPEIKLHPWVTRHGAEPLPSEDENCTLVEVTEEEVENSVKHIPS 
LATVILVKTMIRKRSFGNPFEGSRREERSLSAPGNLLTKKPTRECESLSELKEARQRRQP 
PGHRPAPRGGGGSALVRGSPCVESGWAPAPGSPARMHPLRPEEAMEPE 

SEQ ID NO: 187_5R69_17_2_H 

MQEI PQEQI KEI KKEQLSGS PWI LLRENEVSTLYKGEYHRAPVAI KVFKKLQAGS I AI VP 
QTFNKEI KTMKKFESPNI LRI FGI CI DETVTPPQFS I VMEYCELGTLRELLDREKDLTLG 
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FIGURE 1Q 

KRMVLVLGAARGLYRLHHSEAPELHGKIRSSNFLVTQGYQVKIAGFELRKTQTSMSLG r rT 
REKTDRVKSTAYLSPQELEDVFYQYDVKSEIYSFGIVLWEIATGDIPFQGEECEDWLSOW 

L 

SEQ ID NO: 188_H85811_H 

MAPWEGI^SHVQVFSPHTLQSSAFCSVKKLKIEPSS^DMTGYGSHSKVYSQSKNIPLS 
QPATTTVSTSLPVPNPSLPYEQTIVFPGSTGHIWTSASSTSVTGQVLGGPHNLMRRSTV 
SLLDTYQKCGLKRKSEE I ENTS S VQ 1 1 EEHP PMI QNNASGATVATATTSTATS KNSGS NS 
EGDYQLVOHEVLCSMTNTYEVLEFLGRGTFGQWKCWKRGTNEIVAIKILKNKPSYARQG 
QIEVSILARLSTESADDYNFVRAYECFQHKNHTCLVFEMLEQNLYDFLKQNKFSPLPLKY 
IRPVLQQVATALMKLKSLGLIHADLKPENIMLVDPSRQPYRVKVIDFGSASHVSKAVCST 
YLQSRYYRAPEI ILrGLPFCEAIDMWSLGCVIAELFLGWPLYPGDSEYDQIRYI SQTQGLP 
AEYLLSAGTKTTRFFNRDTDSPYPLWRLKTPDDHEAETGIKSKEARKYIFNCLDDMAQVN 
MTTDLEGSDMLVEKADRREFIDLLKKMLTIDADKRITPIETLNHPFVTMTHLLDFPHSTH 

VKSCFQNMEICKRRVNMYDTVNQSCT 

AAVAQRSMPLQTGTAQICARPDPFQQALIVCPPGFQGLQASPSKHAGYSVRMENAVPIVT 
QAPGAQPLQIQPGLLlAQQAWPSGTQQILLPPAWQQLTGVATHTSVQHATVI petmagtqq 
I^WRNTHAHGSHYNPIMQQPALLTGHVT^ 

QSSVRNVSTCEVSSSQAISSPQRSKRVKENTPPRCAMVHSSPACSTSVTCGWGDVASSTT 
RERQRQTIVIPDTPSPTVSVITISSDTDEEEEQKHAPTSTVSKQRKNVISCVTVHDSPYS 

DSSSNTSPYSVQQRAGHNNANAFDTKGSLENHCTGNPRTI I VPPLKTQASEVLVECDSLV 
PVNTSHHSSSYKSKSSSNVTSTSGHSSGSSSGAITYRQQRPGPHFQQQQPLNLSQAQQHI 
TTDRTGSHRRQQAYITPTMAQAPYSFPHNSPSHGTVHPHLAAAAAAAHLPTQPHLYTYTA 
PAALGSTGTVAHLVASQGSARHTVQHTAYPASIVHQVPVSMGPRVLPSPTIHPSQYPAQF 

AHQT Y I S AS PAS TVYTG Y PLS PAKVNQ Y P Y I 
SEQ ID NO: 18 9_DYRK3_H 

MMIDETKCPPCSNVLCNPSEPPPPRRLNMTAEQFTGDHTQHFLDGGEMKVEQLFQEFGNR 
KSNTIQSDGISDSEKCSPTVSQGKSSDCLNTVKSNSSSKAPKWPLTPEQALKQYKHHLT 
AYEKLE I INYPEI YFVG PNAKKRHGVI GGPNNGGYDDADGAY I HVPRDHLlAYR YEVLKI I 
GKGSFGQVARVYDHKLRQYVALKKVRNEKRFHRQAAEE I R I LEHLKKQDKTGSKNV I HML 
ESFTFRNHVCl^FELLSIDLYELIKKNKFQGFSVQLVRKFAQSILQSLDALHKNKIIHCD 

LKPENILLKHHGRSSTKVIDFGSSCFEYQKLYTYIQSRFYRAPEI ILGSRYSTPIDI WSF 
RCILAELLTGQPLFPGEDEGDQIACMMELLGMPPPKLLEQSKRAKYFlNSKGIPRYCSVT 
TQADGRWLVGGRSRRGKKRGPPGSKDWGTALKGCDDYLFIEFLKRCLHWDPSARLTPAQ 
ALRHPWISKSVPRPLTTIDKVSGKRVWPASAFQGLGSKLPPWGIANKLKANLMSE 

SIPLGSVLPKLIS 

SEQ ID NO: 190_AA589241__M DYRK3_M 

TRPELLGMPPQKLLEQSKRAPCYFINSKGLPRYCSVSTQTDGRWLLGGRSRRGKKRGPPG 
SKDWATALKGCGDYLFIEFLKRCLQWDPSARLTPAQALRHPWISKSTPKPLTMDKVPGKR 
VVNPTNAFQGLGSKJjPPVVGIASKLKANLMSETSGS I PLCSVLPKLI S 

SEQ ID NO: 1 9 1__5R72_1 6_2_H 

MAGGRGAPGRGRDEPPESYPQRQDHELQALEAIYGADFQDLRPDACGPVKEPPEINLVLY 
PQGLTGEEVYVKVDLRVKCPPTYPDWPEIELKNAKGLSNESWLLKSRLEELAKKHCGE 
VMIFELAYHVQSFLSEHNKPPPKSFHEEMLERRAQEEQQRLLEAKRKEEQEQREILHEIQ 
RRKEEIKEEKKRKEIVIAKQERLEIASLSNQDHTSKKDPGGHRTAAILHGGSPDFVGNGKHR 
ANSSGRSRRERQYSVCNSEDSPGSCEILYFNMGSPDQLMVHKGKCIGSDEQLGKLVYNAL 
ETATGGFVLLYEWVLQWQKKMGPFLTSQEKEKIDKCKKQIQGTETEFNSLVKLSHPNVVR 
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FIGURE 1R 

YLAMNLKEQDDS I WDI LVEHI SGVSLAAHLSHSGP I PVHQLRRY'TAQLLSGLDYLHSK S 
WHKVLSAS NVLVDAEGTVKI TDYS I SKRLAD I CKEDVFEC'TRVRFSDNALPYKTGKKGD 
VWRLGLLLLSLSQGQECGEYPVTIPSDLPADFQDFLKKCVCLDDKERWSPQQLLKHSFIN 
PQPKMPLVEQSPEDSGGQDYVETVI PSNRLPSAAFFSETQRQFSRYFI EFEELQLLGKGA 
FGAVI KVQNKLDGCCYAVKRI PINPASRQFRRI KGEVTLLSRLKHENI VRYYNAW I ERHE 
RPAGPGTPPPDSGPLAKDDRAARGQPASDTDGLDSVEAAAPPPILSSSVEWSTSGERSAS 
ARFPATGPGSSDDEDDDEDEHGGVFSQSFLPASDSESDI I FDNEDENSKSQNQDEDCNE K 
NGCHESEPSVTTEAVTiYLYIQMEYCEKSTLRDTIDQGLYRDTVRLWRLFREILDGLAYIH 
EKGMIHRI^LKPWIFLDSDDHVKIGDFGLATDHLAFSADSKQDDQTGDLIKSDPSGHLTG 
MVGTALYVS PEVQGSTKSAYNQKVDLFSLG 1 1 FFEMS YHPMVTASERI FVLNQLRDPTS P 
KFPEDFDDGEHAKQKSVISWLLNHDPAKRPTATELLKSELLPPPQMEESELHEVLHHTLT 
NVDGKAYRTMMAQ IFSQRIS PAI DYTYDSD I LKGNFS I RTAKMQQHVCET 1 1 RI FKRHGA 
VQLCTPLLLPRNRQIYEHNEAALFMDHSGMLVML^ 

RVFRPRKLDRFHPKELLECAFDI VTSTTNSFLPTAEI I YTI YEI IQEFPALQERNYS I YL 
NHTMLLKAILLHCGIPEDKLSQVYIILYDAVTEKLTRREVEAKFCNLSLSSNSLCRLYKF 
IEQKGDLQDLMPTINSLIKQKTGIAQLVKYGLKDLEEVVGLLKKLGIKLQVLINLGLVY?: 
VQQHNG 1 1 FQFVAF I KRRQRAVPE I LAAGGRYDLL I PQFRGPQALG P VPTA I GVS I A I D K 
ISAAVLNMEESVTISSCDLLWSVGQMSMSRAINLTQKLWTAGITAEIMYDWSQSQEELQ 
EYCRHHEITYVALVSDK£GSHVKVKSFEKERQTEKRVLE 

RELASDNLAVQNLKGSFSNASGLFEIHGATVVPIVS\n^PEKLiSASTRRRYETQVQTRLQT 
SIANLHQKSSEIEIIJ^VDLPKETILQFLSLEWDADEQAFNTTVKQLLSRLPKORYLKLVC 

DEI YNIKVEKKVSVLFLYSYRDDYYRILF 
SEQ ID NO: 192_R4 3 524_K, HRI_H 

MLiGGNSGVRKREEEGDGAGAVAAPPAIDFPAEGPDPEYDESDVPAEIQVIjKEPLQQPTFP 
FAVANQLLLVSLLEHLSHVHEPNPLRSRQVFKLLCQTFIKMGLLSSFTCSDEFSSLRLHH 
NRAITHLMRSAKERVRQDPCEDISRIQKIRSREVALEAQTSRYLNEFEELVILGKGGYGR 
VYKVRNKLDGQYYAI KXILIKGATKTVCMKVLREVKV'LAGLQHPNI VGYHTAWIEHVH^ 
QPRADRAAIELPSLEVLSDQEEDREQCGVKNDESSSSSI IFAEPTPEKEKRFGESDTENQ 
NNKSVKYTTNLVIRESGELESTLELQENGLAGLSASSIVEQQLPL.RRNSHLEESFTSTEE 
SSEENVNFLGQTEAQYHLMLHIQMQLCELSLWDWIVEI^^ 
KIFQELVEGVFYIHlSn^GIVHRDLKPRNIFLHGPDQQVKIGDFGIjAC 

GKRTPTHTSRVGTCLYASPEQLEGSEYDAKSDMYSLGWLLELFQPFGTEMERAEVLTGL 
RTGQLPESLRKRCPVOAKYIQHLTRRNSSQRPSAIQLLQSELFQNSGJSTVNLTLQMKIIEQ 
EKEI AEL KKQ LNL LSQDKGVRDDG KDGGVG 

SEQ ID NO: 193_17000057519457_H 

MAAARATTPADGEEPAPEAEAIxAAARERSSRFLSGLELVKQGAEARVFRGRFQGRAAVIK 
HRFPKGYRHPALEARLGRRRTVQEARALLRCRRAGI SAPWFFVDYASNCLYMEEIEGSV 
TVRDYIQSTMETEKTPQGLSNLAKTIGQVIARMHDEDLIHGDLTTSNMLLKPPLEQLNIV 
LIDFGLSFISALPEDKGVDLYVLEKAFLSTHPNTETVFEAFLKSYSTSSKKARPVLKKLD 
EVRLRGRKRSMVG 

SEQ ID NO: 194_AA013524_M 

LVQQGAEARVFRGRFQGRAAWKHRFPKSYRHPELEARLGRRRTVQEARALLRCRRAGIA 
APWFFVDYASNCLYMEEIEDSVTVRIDYIQSTMETEKDPQGL^^ 

LIHGDLTTSNNLLRRPLAQLHI VLIDFGLSFVSGLPEDKGVDLYVLEKAFLSTHPHTETA 
FEAFLKSYGASSKKSSPVLKKLDEVRLRGRKRSMVG 
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FIGURE IS 



SEQ ID NO: 195_17000139801197_H , IRAKM_H 

MAGNCGARGALSAHTLLFDLPPALLGELCAVLDSCDGALGWRGLAERLSSSWLDVRHIEK 
YVDQGKSGTRELLWSWAQKNKTIGDLLQVLQEMGHRRAIHLITNYGAVLSPSEKSYOEGG 
FPNILFKETANVTVDNVLIPEHNEKGVLLKSSISFQNI IEGTRNFHKDFLI GEGET FEVY 
RVEIQNLTYAVKLFKQEKKMQCKJOiWKRFLSELEVLLLFHHPNILELAAYFTETEKFCLi 
YPYMRNGTLFDRLQCVGDTAPLPWHIRIGILIGISKAIHYLHN\^QPCSVICGSISSANIL 
LDDQFQPKLTDFAMAHFRSHLEHQSCTINMTSSSSKHLWYMPEEYIRQGKLSI KTDVYSF 
GIVIMEVLTGCRWLDDPKHIQLRDLLRELMEKRGLDSCL.SFLDKIWPPCPRNFSAKLFC 
LAGRCAATRAKLRPSMDEVLNTLESTQASLYFAEDPPTSLKSFRCPSPLFLENVPSI PVE 
DDESQNNNLLPSDEGLRIDRMTQKTPFECSQSETVMFLSLDKKPESKRNEEACNMPSSSCE 
ESWFPKYI VPSQDLRPYKVNIDPSSEAPGHSCRSRPVESSCSSKFSWDEYEQYKKE 

SEQ ID NO: 1 96_AA84 0 5 98_M IRAKM_M 

MWKRFLSELEVLLLFRHPHILEl^AAYFTETEKLCLVYPYMSNGTLFDRLQCTNGTTPLSV? 
HWISVLIGIAKAIQYLHOTQPCAVICGNVSSANILLDDQLQPKLTDFAAAHFRPNLEQQ 
SSTINMTGGGRKHLWYMPEEYIRQGRLSVKTDW 

DLLMELMEKRGLDSCLSFLDRKIPPCPRNFSAXLFSLAGRCVATKAKljRPTMDEVLSSLE 
STQPSLYFAEDPPTSLKSFRCPSPLFLDNVPS I PVEDDENQNNHSVPPKEVLGTDRVTQK 
TPFECSQSEVTFLGLDRNRGNRGSEADCWPSSSHEECWSPELVAPSQDLSPTVISLGSS 
WEVPGHSYGSKPMEKRCSSGLFCSEHEQSKKQ 

SEQ ID NO: 197_AA088547_H 

MASAVRGSRPWPRLGLQLQFAALLLGTLSPQVHTLRPENLLLVSTLDGSLHALSKQTGDL 

KWTLRDDPVIEGPMYVTE^4AFLSDPADGSLYILGTQKQQGLMKLPFTIPELVHASPCRS£ 

DGVFYTGRKQDAWFVVDPESGETQMTLTTEGPSTPRLYIGRTQYTVTMHDPRAPALRWNT 

TYRRYSAPPMDGSPGKYMSHIJ\SCGMGLLLTVDPGSGTVLWTQDLGVPVMGVYTW 

RQLPHLTIoARDTLHFLALRWGHIRLPASGPRDTATLFSTLDTQLLMTLYVGKDETGFYVS 

KALVHTGVALVPRGLTLAPADGPTTDEVTLQVSGEREGSPSTAVRYPSGSVALPSQWLLI 

GHHELPPVLHTTMLRVHPTLGSGTAETRPPENTQAPAFFLELLSLSREKLWDSELHPEEK 

TPDSYLGLGPQDLIJ^SLTAVLLGGWILFVMRQWEKQQETPLAPADFAHISQDAQSLHS 

GASRRSQKRLQSPSKQAQPLDDPEAEQLTWGKISFNPKDVLGRGAGGTFVFRGQFEGRA 

VAVKRLLRECFGLVRREVQLLQESDRHPNVLRYFCTERGPQFHYIALELCRASLQEYVEN 

PDLDRGGLEPEVVLQQIJ^SGI^LHSLHIVHRDLKPGNILITGPDSQGLGRVVLSDFGLC 

KKLPAGRCSFSLHSGIPGTEGWMAPELLQLLPPDSPTSAVDIFSAGCVFYYVLSGGSHPF 

GDSLYRQANILTGAPCLAHLEEEVHDKWAR 

AKQLQFFQDVSDWLEKESEQEPLVRALEAGGCAWRDNWHEHISMPLQTDLRKFRSYKGT 
SWDLLRAVRNKKHHYRELPV^WQAIjGQVPDGFVQYFTNRFPRLLLHTHRAMRSCASES 
LFLPYYPPDSEARRPCPGATGR 

SEQ ID NO: 1 98_HGP_6 64 4 4 6 6 

MEGISNFKTPSKLSEKKKSVLCSTPTINI PASPFMQKiGFGTGVNVYLMKRSPRGLSHSP 
WAVKXINPICNDHYRSVYQKRLiMDELAKILKSLHHPNI VGYRAFTELA2\T 
KSLNDLIEERYKASQDPFPAAI ILKVALNMARGLKYL^ 

TI KICDVGVSLPLDENMTVTDPEACYIGTEPWKPKEAVEENGVI TDKADI FAFGLTLWEM 
MTLSI PHINLSNDDDDEDKTFDESDFDDEAYYAALGTRPPINMSELDESYQKVIELFSVC 
TNEDPKDRPSAAHI VELALETDV 

SEQ ID NO: 199_AA449542_K 

SPRGLSHSPWAVKKI SLLCDDKYRTVYQKRLTDEAKI LKNLNHPNI IGYRAFTEASDGSL 
CLAMEYGGEKSLNDLIEERNKDSGSPFPAAVILRVALHMARGLKYLHQEKKLLHGD 
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FIGURE IT 



NVYI KGDFET I KT CDVGVSLPLDENMTVTDFEACY I GTEPWKPKEALEENG I I TDKADVF 
AFGLTLWEMMTLCIPKWLPDDDVDEDATFDESDFDDEAYYAALGTRPS INMELDDS YQ>: 
AIELFCVCTNEDPKDRPSAAHIVEALELDGQCCGLSESKH 

SEQ ID NO: 200_5R57_1 0__2_M TESK2_M 
LLDSDLYLPWT\^VKLAYGIAVGLSYLHFKGIFHRDLTSKV 

SEQ ID NO: 201_AA232253_H 

MSSLGASFVQI KFDDLQFFENCGGGSFGSVYRAKWISQDKEVAVKKLLKI EKEAEI LSVL 
SHRNIIQFYGVILEPPNYGIVTEYASLGSLYDYINSNRSEEMDMDHIMTWATDVAKGMHY 
LHMEAPVKVIHRDLKSRNVVI AADGVLKI CDFGASRFHNHTTHMSLVGTFPWMAPEVIQS 
LPVSETCDTYSYGWLWEMLTREVPFKGLEGLQVAWLWEKNERLTIPSSCPRSFAELLH 
QCWEADAKKRPSFKQI I S ILESMSNDTSLPDKCNSFLHNKAEWRGEI EATLERLKKLERD 
LS FKEQELKERERRLKMWEQKLTEQSNTPLLPS FE I GAWTEDDVYCVA/QQLVRKGDS S AE 
MSVYASLFKENNITGKRLLLLEEEDLKDMGI VSKGHI IHFKSAI EKLTHDYINLFHFPPL 
IKDSGGEPEENEEKIVNLELVFGFHLKPGTGPQDCKWKMYMEMDGDEIAITYIKDVTFNT 
NLPDAEILKMTKPPFVMEKWIVGIAKSQTVECTVTYESDVRTPKSTKHVHLIQWSRTKPQ 
DEVKAVQLAIQTLFTNSDGNPGSRSDSSADGQWLDTLRMRQIASNTSLQRSQSNPILGSP 
FFSHFDGQDSYAAAVRRPQVPIKYQQITPVNQSRSSSPTQYGLTKNFSSLHLNSRDSGFS 
SGNTDTSSERGRYSDRSRNKYGRGS I SLiNSSPRGRYSGKSQHSTPSRGRYPGKFYRVSQS 
ALNPHQSPDFKRSPRDLHQPNTIPGMPLHPETDSRASEEDSICVSEGGWTKVEYRKKPHRP 
S PAKTNKERARGDHRGWRNF 

SEQ ID NO: 2 02_AI 3 7 5 1 3 7_H 

MGNYKSRPTQTCTDEWKKKVSESYVITIERLEDDLQIKEKELTELRNI FGSDEAFSKVNL 
NYRTENGLSLLHLCCICGGKKSHIRTLMLKGLRPSRLTRNGFTAIjHLAVYKDNAELITSL 
LHSGADIQQVGYGGLTALHIATIAGHLE1AAI3VLLQHGANVNIQDAVFFTPLHIAAYYGHE 
^OVTRLLLKFGADVNVSGEVGDRPLHIASAKGFLNIAKLLMEEGSKADVNAQDNEDHVPIjH 
FCSRFGHHDI VKYLLQSDLEVQPHVVNI YGDTPLHLACYNGKFEVAKEI IQI SGTESLTK 
ENIFSETAFHSACTYGKS IDLVKFLLDQNVININHQGRDGHTGLHSACYHGHIRLVQFLL 
DNGADMNLVACDPSRSSGEKDEQTCLMWAYEKGHDAIVTLLKHYKRPQDELPCNEYSQPG 
GDGSYVSVPS PLGKI KSMTKEKADI LLLRAGLPSHFHLQLSEI EFHEI I GSGSFGKVYKG 
RCRNKI VAIKRYRANTYCSKSDVDMFCREVSILCQLNHPCVIQFVGAGLNDPSQFAI VTQ 
YISGGSLFSLLHEQKRILDLQSKLIIAVDVAKGMEYLHNLTQPIIHRDDNSHNILLYEDG 
HAWADFGESRFLQSLDEDNMTKQPGNLRWMAPEVFTQGTRYTI KADVFS YALCLWE I LT 
GEI PFAHLKPAAAAADMAYHHIRPPIGYS I PKPI SSLLI RGWNACPEGRPEFSEWMKLE 
ECLCNI ELMSPASSNSSGSLS PSSSSDCLVNRGGPGRSHVAALRSRFELEYALNARSYAA 
LSQSAGQYSSQGLSLEEMKRSLQYTPIDKYGYVSDPMSSMHFHSCRNSSSFEDSS 

SEQ ID NO: 203_H97685_H 

MESERSPLYRQLIDLGYLSSSHWNCGAPGQDTKAQSMLVEQSEKLRHLSTFSHQVLQTRL 
VDAAKALNL VHCHCLD I F I NQAFDMQRDLQ I TPKRLE YTRKKENEL YE S LMN IANRKQEE 
MKDMIVETLNTMKEELLDDATNMEFKDVIVPENGEPVGTREIKCCIRQIQELI ISRLNQA 
VANKLISSVDYLRESFVGTLERCLQSLEKSQDVSVHITSNYLKQILNAAYHVEVTFHSGS 
SVTRMLWEQI KQI IQRITWVSPPAITLEWKRKVAQEAIESLSASKLAKS I CSQFRTRLNS 
SHEAFAASLRQLEAGHSGRLEKTEDLWLRVRKDHAPRLARLSLESRSLQDVLLHRKPKLG 
QELGRGQYGVVTLCDNVJGGHFPCALKSWPPDEKHWNDIALEFHYMRSLPKHERLVDLHG 
SVIDYNYGGGSS IAVLLIMERLKRDLYTGLKAGLTLETRLQI ALDWEGI RFLHSQGLVH 
RDI FCLh'ZNVLLDKQNRAKI TDLGFCKPEANMSGS I VGTP I HMAPELFTGKYDNS VDVTAFG 
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FIGURE 11 



ILFWYICSGSVKLPEAFERCASKDHLWNNVRRGARPERLPVFDEECWQLMEACWDGDPLK 

RPLLGIVQPMLQGIMNRLCKSNSEQPNRGLDDST 

SEO ID NO: 204_W2 08iO_M 

DVNLKASKASDVYSFGILVWAVLAGREAELVDKTSLIRETVCDRQSRPPLTELPPGSPET 
PGLEKLKELMIHCWGSQSENRPSFODCEPKTNEVYNLVKDKVDAAVSEVKKYLSQHRSSG 
RNLSAREPSQRGTEMDCPRETMVSKMLDRLHLEEPSGPVPGKCPERQAQDTSVGPATPAR 
TSSDPVAGTPQIPHTLPFRGTTPGPVFTETPGPHPQRNQGDGRHGTPWYPWTPPNPMTGP 

PALVFNNCSEVQIGNYNSLVAPPRTTASSSAKYDQAQFGRGRGWQPFHK 
SEO ID NO: 205 AA744236_H 

MGSENSALKSYTLREPPFTLPSGLAVYPAVLQDGKFASVFVYKRENEDKVNKAAKHLKTL 
RHPCLLRFLSCTVEADGIHLVTERVQPLEVALETLSSAEVCAGIYDILLALIFLHDRGHL 
THNNVCLSSVFVSEDGHWKLGGMETVCKVSQATPEFLRSIQSIRDPASIPPEEMSPEFTT 

LPECHGHARDAFSFGTLVESLLTILNEQVSADVLSSFQQTLHSTLLNPI PKCRPALCTLL 
SHDFFRNDFLEWNFLKSLTLKSEEEKTEFFKFLLDRVSCLSEELIASRLVPLLLNQLVF 

AEPVAVKSFLPYLLGPKKDHAQGETPCLLSPALFQSRVIPVLLQLFEVHEEHVRMVLLSH 
IEAYVEHFTQEQLKKVILPQVLLGLRDTSDSIVAITLHSLAVLVSLLGPEVWGGERTKI 
FKRTAPSFTKNTDLSLEGDPFSQPIKFPINGLSDVKNTSEDSENFPSSSKKSEEWPDWSE 
PEEPENQTVNIQIWPREPCDDVKSQCTTLDVEESSWDDCEPSSLDTKVNPGGGITATKPV 
TSGEQKPIPALLSLTEESMPWKSSLPQKISLVQRGDDADQIEPPKVSSQERPLKVPSELG 
LGEEFTIQVKKKPVKDPEMDWFADMI PEIKPSAAFLILPELRTEMVPKKDDVSPVMQFSS 
KFAAAE I TEGEAEGWEEEGELNWEDNNW 

SEQ ID NO: 2 06 AI0522 5 0_H 

MESMLNKLKSTVTKVTADVTSAVMGI PVTREFDVGRHIASGCNGLAWKI FNGTKKSTKQE 
VAVFVFDKKLIDKYQKFEKDQIIDSLKRGVQQLTRLRHPRLLTVQHPLEESRDCLAFCTE 
PVFASLANVLGNWENLPS P I S PDI KD YKLYDVETKYGLLQVS EGLS FLHS S VKMVHGN I T 
PENI ILNKSGAWKIMGFDFCVSSTNPSEQEPKFPCKEWDPNLPSLCLPNPEYLAPEYI LS 
VSCETASDMYSLGTVMYAVFNKGKPIFEVNKQDIYKSFSRQLDQLSRLGSSSLTNIPEEV 
REHVKLLLNVTPTVRPDADQMTKIPFFDDVGAVTLQYFDTLFQRDNLQKSQFFKGLPKVL 
PKLPKRVIVQRILPCLTSEFVNPDMVPFVLPNVLLIAEECTKEEYVKLILPELGPVFKQQ 
EPIQILLIFLQKMDLLLTKTPPDEIKNSVLPMVTRALEAPSIQIQELCLNI IPTFANLID 
YPSMKNALIPRI KNACYKHLPLRFV 

SEO ID NO: 2 07 AA278 84 2_H 

MWFFARDPVRDFPFiLIPEPPEGGLPGPWALHRGRKKATGSPVSIFVYDVKPGAEEQTQV 

AKAAFKRFKTLRHPNI LAYI DGLETEKCLHWTEAVTPLG I Y LKAR VEAGG LKE LE I SWG 
LHQIVKALSFLVNDCSLIHNNVCMAAVFVDRAGEWKLGGLDYMYSAQGNGGGPPRKGIPE 
LEOYDPPELADSSGRWREKWSADMWRLGCLIWEVFNGPLPRAAALRNPGKI PKTLVPHY 
CELVGANPKVRPNPARFLQNCRAPGGFMSNRFVETNLFLEEIQIKEPAEKQKFFQELSKS 

LDAFPEDFCRHKVLPQLLTAFEFGNAGAWLTPLFKVGKFLSA3EYQQKI I PVWKMFSS 
TDRAMRIRLLQQMEQFIQYLDEPTWTQIFPtlVVKGFLDTNPAIREQTVKSML^PKLN 
EANLNVELMKHFARLQAKDEQGPIRCNTTVCLGKIGSYLSASTRHRVLTSAFSRATRDPF 

APSRVAGVLGFAATHNLYSMNDCAQKILPVLCGLTVDPEKSVRDQAFKAIRSFLSKLESV 

SEDPTQLEEVEKDVHAASSPGMGGAAASWAGWAVTGVSSLTSKLIRSHPTTAPTETNIPQ 
RPTPEGVPAPAPTPVPATPTTSGHWETQSEDKDTAEDSSTADRWDDEDWGSLEQEAESVL 

AQQDDWSTGGQVSRASQVSNSDHKSSKSPESDWSSWEAEGSWEOGWQEPSSQEPPPDGTR 

LASEYNWGGPESSDKGDPFATLSARPSTQPRPDSWGEDNWEGLETDSRQVKAELARKKRr. 

E RRRE MEAKRAE R KV AKG PM KLGAR KLD 
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FIGURE IV 



SEQ ID NO: 2 0 8_AA5 9 9 2 86_H 

NlAFMEKPPAGK\ r LLDDTVFLTAAI EASQSLQSHTEYI I RVQGGI SYENSWQI VRRYSDFD 
LLNNSLQIAGLSLFLPPKKLIGK^DREFIAERQKGLQNYLNVITTNHILSNCELVKKFLD 
PNNYSANYTEIALQQVSMFFRSEPKWEWEPLKDIGWRIRKK^ 

ADLGPDKYLSDKDFQCLI KLLPSCLHPYI YRVTFATANESSALLI RMFNEKGTLKDLI YK 
APCPKDPFLKKYCNPKKIQGLELQQI KTYGRQILEVLKFLHDKGFPYGHLHASNVMLDGDT 
CRLLDLENSLLGLFSFYRSYFSQFRKINTLES\nDVHCFGKLiLYEMTYGRPPDSVPVDSFP 
PAPSMAWAVLESTLSCEACKNGMPTI SRLLQMPLFSDVLLTTSEKPQFKI PTKLKEALR 
IAKECIEKRLIEEQKQIHQHRRLTRAQSHHGSEEERKKRKILARKKSKRSALENSEEHSA 
KYSNSNNSAGSGASSPLTSPSSPTPPSTSGISALPPPPPPPPPPAAPLPPASTEAPAQLS 
SQAVNGMSRGALLSSIQNFQKGTLRKAKPVITVLRRSAEASCLHLEGKVLFYSYSPLPPN 
YPLPGKVI AEPVQPQTVLFCRCSCKQLFERNNSLSRI KLGWHAKKKKKK 

SEQ ID NO: 209_AA425725_H 

MSASTGGGGDSGGSGGSSSSSQASCGPESSGSELAIjATPVPQMLQGLLGSDDEEQEDPKD 
YCKGGYHPVKIGDVFNGRYHWRKLGWGHFSTVVJLCWDIQRK^FVALK^KSAGHYTETA 
VDEIKXiLKCVRDSDPSDPKRETI VQLIDDFRI SGVNGVHVCNTVLEVLGHQLLKWI I KSNY 
QGLPVPCVKSIWQVLHGLDYLHTKCKIIHTDIKPENILLCVGDAYIRRLAAEATEWQQA 
GAPPPSRSIVSTAPQEVLTGKLSKNKRKKMRRKRKQQKRLLE^^ 

EDSGLRLDGGSGSTSSSGFSGSLFSPASCSI LSGSSNQRETGGLLSPSTPFGASNLLVNP 
LEPQNADKIKIKIADLGNACV7VHKHFTEDIQTRQYRAVEVLIGAEYGPPADIWSTACMAF 
ELATGDYLFEPHSGEDYSRDEDHIAHI VELLGDI PPAFALSGRYSREFFNRRGELRHIHN 
LKHWGLYEVLMEKYEWPLEQATQFSAFLLPMMEYIPEKRASAADCLQHPWLNP 

SEQ ID NO: 210_SGK022_H 

MEDFLLSNGYQLGKTIGEGTYSKVKEAFSKKHQRKVAIKVIDKMGGPSEFIQRFLPRELQ 



I VRTLDHKNI IQVYEMLESADGKI CLVMELAEGGDVFDCVLNGGPLPESRAKALFRQMVE 
AIRYCHGCGVAHRDLKCENALLQGFNLKLTDFGFAKVL.PKSHRELSQTFCGSTAYAAPEV 
LQGIPHDSKKGDWSMGVVLYVMLCASLPFDDTDIPKMLWQQQKGVSFPTHLSISADCQD 
LLKRLLEPDMILRPSIEEVSWHPWLAST 

SEQ ID NO: 2 1 1_AA0 6 0 0 2 6_M SGK022_M 

MEDFLLSNGYQLGKTIGEGTYSKVKEAFSKKHQRKVAI KI IDKMGGPEEFIQRFLPRELQ 
IWTLDHKNIIQVYEMLESADGKIYLVMELAEGGDVFDC^ 

AIRYCHGCGVAHRDLKCENALLQGFNLKLTDFGFAKVLPKSRRELSQTFCGSTAYAAPEV 
LQGI PHDSKXGDVWSMGVVLYVMLCASLPFDDTDI PKMLWQQQKGVSFPTHLGI STECQD 
LLKRLLEPDMILRPSIEEVSWHPWLAST 

SEQ ID NO: 2 12_AA3 99669_H 

MGKGDVLEAAPTTTAYHSLMDEYGYEVGKAIGHGSYGSVTELAFYTKQKVMVAVKI ISKKK 
ASDDYLNKFLPREIQQVMKVLRHKYLINFYRAIESTSRVYIILELAQGGDVLEWIQRYGA 
CSEPLAGKWFSQLTLGIAYLHSKSI VHRDLKLENLLLDKWENVKI SDFGFAKMVPSNQPV 
GCSPXYRQVNCFSHLSQTYCGSFAYACPEILRGLPYNPFLSDTWSMGVILYTLWAHLPF 
DDTNLKKLLRETQKEVTFPANHTI SQECKVQLLI ACVAQWRKTQARPLSPLL 

SEQ ID NO: 2 1 3_AA7 5 8 53 9_K 

MDDATVLRKKGY I VGINLGKGSYAKVKSAYSERLKFNVAVKI IDRRKTPTDFVERFLPRE 
MDILATVNHGSI IKTYEI FETSDGRIYI IMELGVQGDLLEFI KCQGALHEDVARKMFRQL 
SSAVKYCHDLDI VHRDLP:CENLLLDKDFNIKLSDFGFSKRCLRDSNGRI I LSKTFCGSAA 
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FIGURE 1 W 

YAAPEVLQS I PYQPKVYDI WSLGVILYIMVCGSMPYDDSDIRKMLRIQKEHRVTDFPRSKN 
LTCECKDLI YRMLQPDVSQRLHIDEILSHSWLQPPKPKATSSASFKREGEGKYRAECKLD 
TKTGLRPDHRPDHKLGAKTQHRLLVVPENENRMEDRLAETSRAKDHKISGAEVGKAST 

SEQ ID NO: 214__AA883975_H 

MSGDKLLSELGYKLGRTIGEGSYSKVKVATSKKY KGTVAIKVVDRRRAPPDFVNKFLPRE 
LS ILRGVRHPHI VHVFEFIEVCNGKLYIVMEAAATDLLQAVQRNGRI PGVQARDLFAQ1 A 
GAVRYLHDHHLVHRDLKCENVLLSPDERRVKLTDFGFGRQAHGYPDLSTTYCGSAAYASP 
EVLLGI PYDPKKYDVWSMGVVTjYVNTVrTGCMPFDDSDIAGLPRRQKRGVLYPEGLELSERC 
KALIAELLQFSPSARPSAGQVARNCWLRAGDSG 

SEQ ID NO: 2 1 5_AA90 54 4 6_H 

VGRQETGVRRWAFLICQPISPPLTSSEFIQRFLPRELQIVRTLDHKNI IQVYEMLESADG 
KICLVMELAEGGDVFDCVL^GGPLPESRAKALFRQ^EAIRYCHGCGVAHRDLKCENALL 
QGFNLKLTDFGFAKVLPKSHRELSQTFCGSTAYAAPEVLQGIPXKMLWQQQKGVSFPTHL 
S I S ADCQDLLKRLLEPDMI LRPS I EEVS WHPWLAST 

SEQ ID NO: 216_H29974_H 

YS LLAE I GRGS YG WYEAVAGRS GAR VAVKKI RCDAPENVELALAE FWALT S LKRRHQNV 
VQFEECVLQRNGLAQRMSHGNKSSQLYLRLVSTSLKGERILGYAEEPCYLWFVMEFCEGG 
DLNQYVLSRRPDPATNKS FMLQLTSAI AFLHKNHI VHRDLKPDNI LI TERSGTP I LKVAD 
FGLS KVCAGLAPRGKEGNQDNKNVNVNKYVfLS SACGSDFYMAPEVWEGK YTAKAD I FALG 
IIIWAMIERITFIDSETKKELLGTYIKQGTEIVPVGEALLENPKMELHIPQKRRTSMSEG 
I KQLLKDMLiAANPQDRPDAFELETRMDQVTCAA 

SEQ ID NO: 2 1 7_AA4 98 1 04_M H29974_M 

PLLLPPPPAAMETGKENGARRGTKSPERKRRSPVQRVLCEKLRPAAQAMDPAGAEVPGEA 
FLARRRPDGGGGDVPARPRYSLLAEIGRGSYGVVYEAVAGRSGARVAVKKIRCDAPENVE 
IJ^JLiAEFWALTSLKRRiiQNIVQF 

GY/^EPCYLWFVMEYCEGGDI^QYVLSRRPDPATNKS FMLQLTSAI AFLHKNH I VHRDLK 
PDNI LI TERSGTP I LKVADFGLSKVCAGLAPRGKEGNQDNKNVNVNKYWLSSACGSDFYM 
APEVWEGHYTAKADIFALGIIIWAMIERITFIDSETKKELLGTYIKQGTEIVPVGEALLE 
NPKMELHI PQKJ^RTSMSEGVKQLLKDMLAANPQDRPDAFELETRMDQVTCAA 

SEQ ID NO: 2 1 8_AA2 1 53 1 1_H 

MVSSQPKYDLIREVGRGSYGWYEAVIRKTSARVAVKKIRCHAPENVELALREFWALSSI 
KSQHPNVIHLEECILQKDGMVQKMSHGSNSSLYLQLVETSLKGEIAFDPRSAYYLWFVMD 
FCDGGDMNEYLLSRKPNRKTNTSFMLQLSSALAFLHKNQIIHRDLKPDNILISQTRLDTS 
DLEPTLKVADFGLSKVCSASGQNPEEPVSVNKCFLSTACGTDFYMAPEVWEGHYTAKADI 
FALGIIIWAMLERITFIDTETKKELLGSYVKQGTEIVPVGEALLENPKMELLIPVKKKSM 
NGRMKQLIKEMLAANPQDRPDAFELELRLVQIAFKDSSWET 

SEQ ID NO: 219_AA0183 61_H 

KRAAFPAGGAGGSVEPPSARPAPQPAGTAARSEEAPARAQAAGMAGPGWGPPRLDGFILT 

ERLGSGTYATVYKAYAKKDTREWA I KCVAKKS LNKAS VENLLTE I E I LKG IRHPHI VQL 
KDFQWDSDNIYLIMEFCAGGDLSRFIHTRRILPEKVARVFMQQLASALQFLHERNISHLD 
LKPQNI LLS SLEKPHLKLADFGFAQHMS PWDEKHVLRGS PLYMAPEMVCQRQYDARVDLW 
SMGVILYEALFGQPPFASRSFSELEEKIRSNRVIELPLRPLLSRDCRDLLQRLLSRDPSR 
RISFQDFFAHPWVDLEHMPSGESLGRATALWQAVKKDQEGDSAAALSLYCKALDFFVPA 



^3/ 
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FIGURE IX 

LHYE\HDAQRKEAI KAKVGQYVSRAEELKA. I VS SSNQALLRQGTS ARDLLREMARDKFRLL 
AALEVASAA>lAKEEAAGGEQDAliDLY0HSLGELLLLLRSPRAGGGSCFTLRFRT3WFELI>: 



SEQ ID NO: 220_AA311714__H 

MENFI LYEEIGRGSKTVVYKGRRKGTINFVAI LCTDKCRRPEI TNWVRLTREI KHKNI VT 
FHEWYETSNHLWLVXENLPEDWREFGIDLI SGLHHLHKLGI LFCDI S PRKI LLEGPGTL 
KFSNFCIJVKVEGENLEEFFALVAAEEGGGDNGElSr^KKSMKSRVKGSPVYTAPEVVRGAD 
FSI SSDLWSLGCLLYEMFSGKPPFFSESVSELTEKILCEDPLPPI PKDSSRPKASSDFIN 
LLJDGLLQRDPQKJILTWTRLLQHSFWKKAFAGADQESSVEDLSLSRNTMECSGPQDSKELL 
QNSQSRQAKGHKSGQPLGHSFRLENPTEFRPKSTLEGQLNESMFLLSSRPTPRTSTAVEV 
SPGEDMTHCSPQKTSPLTKITSGHLSQQDLESQMRELI YTDSDLWTPI IDNPKIMKQPP 
VKFDAKILHLPTYSVDKLLFLKDQDWNDFL^ 

AGHQEVATRLLHSPLFQLLIQHLRIAP^DIRAKVAHVIGLl^SHTTELQEOTPVVETTS 
S I G I G I LNCLVQHSTPVPRQCLVYV 

SEQ ID NO: 221_SGK384_H 

SLiAHVLRARQILTEPEVRDYLRGLVSGLRYLHQRCILHR 
SEO ID NO: 2 2 2_AA2104 51__M SGK384_M 

MGQQHGTRNGLTHRELPRGVGLLLAJ4ALMNVALYLCLDQLFISPGRSTADSRR 
MGRMR2^GSRWLSCEELRTEVRQLKRVGEGAVKRVF 

GLQMLKSLQSEHWTLVGYCEEDGTILTEYHPLGSLSNLEETLNLSKYQDVNTWQHRLQL 
AMEWSIINYLHHSPLGTRVMGDSNDLPKTLSQYLLTSNFSIVANDLDALPLVDHDSGVL 
IKCGHRELHGDFVAPEQLWPYGEDTPFQDDIJ^PSYNEKVDIWKIPDVSSFLLGHVEGSDM 
VRFHLFD I HKACKSQ I PAERPTAQNVLDAYQRVFHS LRDTVMSQTKEML 

RWPrVY^in <: AAO Of; i 'M>\i tf 

SEQ ID NO: 22 3_SGK0 7 1_2_H 

EWAVQMMVECMDDHYASQALEELMPLLKLRHAHISVYQELFITWNGEISSLYLCLVMEF 
NELS FQEVI EDKRKAKKI I DSEWMQNVLGQVLDALE YLHHLD 1 1 HRNLKPSN 1 1 L I S SDH 
CKLQDLSSNVLMTDKAKVTO ILDMTSC 
SFMDGTEAMHLRKSLRQSPGSLKAVLKTMEEKQI PDVETFRNLLPLMLQIDPSDRITI KD 
VVHITFLRGSFK^SC^SLTLHRQMVPASITDKLLEGNVASILGDAGDTKGERALKXLS^ 
LiASYCLVPEGSLFMPLALLHMHDQWLSCDQDRVPGKRIDFASLiGKLGKLLGPI PKGLPWPP 
ELVEVVVTTMELHDRVLDVQLCACSLLLHL^ 

PEEEPLLVMVTSLLAITTTQESESLSEELQNAGLLEHILEHLNSSLESRDVCASGLGLLW 
ALLLDDPILALQRPRKKRAPNHGKPGKPKNPASTQS I IVNKAPLEKVPDLI SQVLATYPA 
DGEMAEASCGVFWLLSLLGCIK^QQFEQWALLLQSIRLCQDRAL^^ 
SELAAFKVWQEEGGSGLSLIK£TYQLHRI)DPEWE 
KALLQEIKERFTSSLVSDSSAFSKPGLPPGGSPQLGCTTSGGLE 

SEQ ID NO: 2 2 4_AA1 1 8 3 5 2_M SGKC71_M 

EEDPCQKSWMAPEALKFSFSTKSDIWSLGCIILDMATCSFLNDTEAMQLRKAIRHHPGSL 
KPI LKTMEEKQI PGTDVYYLLLPFMLHINPSDRLAI KIDVMQVTFHSNSFKSSSVALNMQR 
QKVPIFITDVLLEGNMANILGSWLCASFVNDSRHCDSGIGSQRLGFDFQSVSWTEHPLKD 
VMQNFSSRPEVQLRAINKLLTMPEDQLGLPWPTELLEEVI S 1 1 KQHGRI LDI LLSTCSLL 
LRVLGQALlAKDPEAE I PRSSLI I SFLMDTLRSHPNSERLVKWVYNVLAI I SSQGQI SEEL 
EEEGLFQLAQENLEHFQEDRDICLSILSLLWSLLVDVVTVDKEPLEQLSGMVTWVLATHP 
EDVEIAEAGCAVLWLLSLLGCUCESQFEQVWI.LLRSIQLCPGRVLLVNNAFRGLASLAK 
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FIGURE 1Y 

VSELVAFRIWLEEGSSGLHLIQDIYKLYKDDPEVVENLCMLLAHLTSYKEILPEMESGG 
IKDLVQVIRGRFTSSLELISYADEILQVLEANAQPGLQEDQLEPPAGQEAPLQGEPLFRP 

SEQ ID NO: 225_018653 . 9_H 

GRGRGAGHARGLGRGPAGRRAEPPRSLSRPGPGPGSRAGPAGRGEGSDAAPAGGSGRGFL 

RLLPAGLRPQRALRSGSEPPRPGQSPEPSPAPGAGRRGGRGELARQIRARYEEVQRYSRG 

GPGPGAGRPERRRLMDLAPGGPGLPRPRPPWARPLSDGAPGWPPAPGPGSPGPGPRLGCA 

ALRNVSGAQYMGSGYTKA\^RVRLPGGAAVALKAWFSGHDLGSGVREFGVRRGCYRLAA 

HKLLKEMVLLERLRHPNVLQLYGYCYQDSEDIPDTLTTITELGAPVEMIQLLQTSWEDRF 

RICLSLGRLLHHLAHSPLGSVTLLDFRPRQFVLVDGELKVTDLDDARVEETPCAGSTDCI 

LEFPARNFTLPCSAQGWCEGMNEKRNLYNAYRFFFTYLLPHSAPPSLRPLLDSIVNATGE 

LAWGVDETLAQLEKVLHLYRSGQYLQNSTASSSTEYQCIPDSTIPQEDYRCWPSYHHGSC 
LLSVFNLxAKAVDVCESHAQCRAFVVHTN^ 

SEQ ID NO: 22 6_AA3 966 01_M 

TRPGCAALRNVSGAQYVGSGYTKAWRVRLPGGAAVALKAVDFSGHDLGSCVREFGARRG 

CYRLAAHKLLKE:^^LERLRHPNVLQLYGYCYQDSEGIPDTLTTITELGAPVEM 

SWEDRFRICLSLGRLLHHLAHSPLGSVTLLDFRPRQFVLVNGELKVTDLDDARVEETPCT 

SSADCTLEFPARNFSLPCSAQGWCEGMNEKRNLYNAYRFFFTYLLPHSAPPSLRPLLDSI 

WATGEIAWGVDETLAQLETALHLFRSGQYLQNSTSSRAEYQRIPDSAITQEDYRCWPSY 

HHGGCLLSVFNIoAEAIDVCESHAQCRAFVVTNQTTWTGRKLVFFKTGWNQWPDAGKTTY 
VKAPG 

SEQ ID NO: 2 2 7_VRK3_H 

MISFCPDCGKSIQAAFKFCPYCGNSLPVEEHVGSQTFVNPPiVSSFQGSKRGLNSSFETSP 

KKVICWSSTVTSPRLSLFSDGDSSESEDTLSSSERSKGSGSRPPTPKSSPQKTRKSPQVTR 

GSPQKTSCSPQKTRQSPQTLKRSRVTTSLEALPTGTVLTDKSGRQWKLKSFQTRDNQGIL 

YEAAPTSTLTCDSGPQKQKFSLKLDAKDGRLFNEQNFFQRAAKPLQVNKWKKLYSTPLLA 

IPTCMGFGVHQDKYRFLVLPSLGRSLQSALDVSPKHVLSERSVLQVACRLLDALEFLHEN 

EYVHGNWAENIFVDPEDQSQVTLAGYGFAFRYCPSGKH\^AYVEGSRSPHEGDLEFISMD 

LHKGCGPSRRSDLQSLGYCMLKWLYGFLPWTNCLPNTEDIMKQKQKFVDKPGPFVGPCGH 
WIRPSETLQKYLKWMAI/TYEEKPPYAML^ 

SEQ ID NO; 228_S71575_M VRK3_M 

IPTCIGFGIHQDKYRFLVFPSLGRSLQSALDDNPKHWSERCVLQVACRLLDALEYLHEN 
EYVHGNLTAENVFVNPEDLSQVTLVGYGFTYRYCPGGKHVAYKEGSRSPHDGDLEFISMD 
LHKGCGPSRRSDLQTLG YCMLKWLYGSLPWTNCLPNTEKI TRQKQKYLDS PERLVGLGGR 
WNKASETLREYLKWMALNYEEKPPY 

SEQ ID NO: 2 2 9_AA4 54 2 7_H 

MGHALCVCSRGTVI IDNKRYLFIQKLGEGGFS YVDLVEGLHDGHFYALKRIL.CHEQQDRE 

EAQREADMHRLFNHPNILRLVAYCLRERGAKHEAWLLLPFFKRGTLWNEIERLPCDKGNFL 

TEDQILWLLLGICRGLEAIHAKGYAHRDLKPTNILLGDEGQPVL^LGSMNQACIHVEGS 

RQALTLQDWAAQRCTISYRAPELFSVQSHCVIDERTDVWSLGCVLYAMMFGEGPYDMVFQ 

KGDSVAIAVQNQLSIPQSPRHSSALRQLLNSMMTVDPHQRPHIPLLLSQLEALQPPAPGQ 
HTTQI 

SEQ ID NO: 230_H05721_H 

MAVRQALGRGLQLGRALLLRFTGKPGRAYGLGRPGPAAGCVRGERPGWAAGPGAEPRRVG 
LGLPNRLRFFRQSVAGI^ARLQRQFVVRAWGCAGPCGRAVFLAFGLGLGLIEEKQAESRR 
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FIGURE 17 



avsacqeiqai ftqks kfgpdfldtrrlqgfrleeyli gqs i gkgcsaavyeatmptlp" 
nlevtkstgllpgrgfg73apgegqerapgapafplai kmmwni sags s sea! lntmsqe 
lvpasrvalageyga\tyrkskrgpkqlaphpni i rvlrafts s vpllpgalvdy pdvlp 
srlhpeglghgrtlflw1knypctlrqylc\^ t tpspriwu^mllqllegvdhlvqqgiai: 
rdlksdni lv^ldpdgcpwlvi adfggclades iglqlpfsswyvdrggngclmapevst 
arpgpravidyskadawavgai ayei fglvnpfygqgkahlesrs yqeaqlpalpes vpp 
dvrqlvrallqreaskj^psai^vaanvlhl^ 
anrltekccvetkmkmlflanlecetlcqaalllcswraal 

seq id no: 231_ai086865_h 

mekyerirwgrgafgiwlclrkadqklviikqipvt:qmtkeerqaaqnecqvlkll^ 
pnvieyyenfledkaij^iameyapggtlaefiqkrcnsll^ 

thlilhrdlktqnilldkhrmvvkigdfgi skilsskstpcyi spelcegkpynqksdi w 
algcvlyelxaslk^vfeaanlpalvlkimsgtfapisdryspelrqlvlsllslepaqrp 
plshimaqplcirallnlhtdgrevrgpqqhreqdhqcplqrgiimtfgsgsngclghgs 
ltdi sqpti veajl.lgyekvqqveealsftllgsapldqeplls idlgtahsaavtgeedl 
gsgdvnrlipswerghllagvasstdvstfsegdckepdkccwrhkqctghi i ypfasdcv 
rhslhlhsvnhcngnsrlkdssedssssrgagptcskvies pgfeltpeeehverfrygw 
cksyrpvsvavihhplykecgaddlnxkkrkrrrrkskppiptqvgpataspdlgtsmat 
gtpdstapitiwrsesptgkgqgskvikkvkkkkek^ 

kks pvkxeps ppdvsrslsarqlarmsess pesreeleseds yngrgqgelssedi vess 
sprkrentvqakktgakpsqarkvnkrksppgsnpnls 

seq id no: 232_aa836348_h 

msvlgeyerhcds insdfgsesggcgdsspgpsasqgpragggaaeqeelhyi pi rvlgr 

gafgeatlyrrteddslwwkevdltrlsekerrdalneivilallqhdni iayynhfmd 

nttllieleycnggnlydkilrqkdklfeeemvvwylfqivsavscihkagilhrdiktl 

nffltkanliklgdyglakklnseysmaetlvgtpyymspelcqgvkynfksdiwa 

ifelltlkrtfdatnplnlcvkivqgiramevdssqysleliqmvhscldqdpeqrptad 

elldrpllrkjirrsstvteapiavwsrtsevywgggkstpqkldviksgcsarqvcag 

nthfawtvekelytwnmqggtklh 

tvcvtdegqlyafgsdyygcmgvdk^agpevlepmqlnfflsnpveqvsggdnhvvvltr 
nk^vyswgcgeygrlgldseedyytpqkvdvpkali i vavqcgcdgtflltqsgkvliagg 
lnefnkl.glnqgmsgi inheayhevpyttsftlakqlsfykirti apgkthtaaidergr 
lltfgcnkcgqlgvgnykkrlginllggplggkqvirvscgdeftiaatdekvlnsktir 
snssglsigtvfqssspgggggggggeeedsqqesetpdpsggfrgtmeadrgmegli sp 
teamgnsngassscpgwlrkelenaefipmpdspsplsaafsesekdtlpyeelqglkva 
seaplehkpqveasvtelfafesqlvtsaescsnlcwegnttdsscvcvqlsaggg 

seq id no: 233_r86668_h, mkk6_h 

mnlllsyrdvqdysaiielvetlqalptcdvaeqhnvcfhytfalnrrnrpgdrakalsv 
llplvqlegsvapdlycmcgriykdmffssgfqdaghreqayhwyrkafdvepslhsgin 
aavlli aagqkfedskelrligmklgcllar kgcvekmqyywdvgfylgaq ilandptqv 
vlaaeqlyklnapiwylvs^etfllyqhfrptpeppggpprrahfwlhfllqscqpfkt 
acaqgdqclvlvlemnkvi-lpaklevrgtdpvstvtlsllepetqdi psswtfpvas i cg 
vsaskrderccflyalppaqdvqlcfpsvgkcqwfcgliqawvtnpdstapaeeaegage 
mlefdyeytetgerlvlgkgtygwyagrdrhtrvriai kei perdsrfsqplheeialh 
rrlrhkni vryl.gsasqggylki fmeevpggslssllrsvwgplkdnesti sfytrqi lq 
glgylhdnhivhrdikgdnvlintfsgllkisdfgtskrlagitpctetftgtlqymape 
1 1 dqgprgygkaadi wslgctviekatgrppfhelgspqaamfqvgmykvhppmpsslsa 
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FIGURE 1 AA 



EAQAFLLRTFEPDPRLRASAQTLLGDPFLQPGKRSRS PSS PRHAPRPSDAPSAS FTPS AN 

STTQSQTFPCPQAPSQHPPSPPKRCLSYGGTSQLRVPEEPAAEEPASPEESSGLSLLHO^ 
SK^RAMLAAVLEQELPALAENLHQEQKQEQGARLGRN^^ 

ELRALQGRLiRAQGLGPALLHRPLFAFPDAVKQ I LRKRQI RPHWMFVLDSLLSRAVRAALG 
VLGPEVEKEAVSPRSEELSNEGDSQQSPGQQSPLPVEPEQGPAPLMVQLSLLRAETDRLR 
EIIAGKEREYQALVQRALQRLNEEAJ^TYVI^PEPPTALSTDQGLVQWLQELNVDSGTI QM 
LLNHS FTLHTLLTYATRDDLI YTRIRGGMVCR I WRAI LAQRAGSTPVTSGP 

SEQ ID NO: 2 3 4_PAK6_H 

MFGKKKKKIEI SGPSNFEHRVHTGFDPQEQKFTGLPQQWHSLLADTANRPKPMVDPSCIT 
PIQLAPMKTIVRGNKPGKETSINGLLEDFDNI SVTRSNSLRKESPPTPDQGASSKGPGHA 
EENGFITFSQYSSESDTTAI)YTTEKYREKSLYGDDLDPYYRGSHAAKQNGHVMKMKHGEA 
YYSEVKPLKSDFARFSADYHSHLDSLSKPSEYSDLKWEYQRASSSSPLDYSFQFTPSRTA 
GTSGCSKESLAYSESEWGPSLDDYDRRPKSSYLNQTSPQPTMRQRSRSGSGLQEPMMPFG 
ASAFKTHPQGHSYNSYTYPRLSEPTMGIPKVDYDRAQMVLSPPLSGSDTYPRGPAKLPQS 
QSKSGYSSSSHQYPSGYHKATLYHHPSLQSSSQYISTASYLSSLSLSSSTYPPPSWGSSS 
DQQPSRVSHEQFRAALQLWSPGDPREYIJ^FIKIGEGSTGIVGIATEKHTGKQVAVKKr^ 
DLRKQQRRELLFNEWIMRDYHHDNVVDMYSSYLVGDELVm^EFLEGGALTDIVTHTRM 
NEEQI ATVCLSVLRALSYLHNQGVIHRDI KSDS I LLTSDGRI KLSDFGFCAQVSKEVPKR 
KSLVGTPYWMAPEVISRLPYGTEVDIWSLGIIWIEMIDGEPPYFNEPPLQAMRRIRDSLP 
PRVKDLHKVSSVLRGFLDLMLVREPSQRATAQELLGHPFLKLiAGPPSCI VPLMRQYRHH 

SEQ ID NO: 2 3 5_SURTK1 0 6_H 

MNDRNE I QMEAKLQSLT 1 1 AQE I LCRFF I TLRRHARFLLTKLGRQGMARSG I THSCAVC I 
LCGPSREGDSPVANGMTRMLLECSLSDKLCVIQEKQYEVI I VPTLLVTI FLI LLGVILWL 
FIREQRTQQQRSGPQGIAPVPPPRDLSWEAGHGGNVALPLKETSVENFLGATTPALAKLQ 
VPREQLSEVLEQICSGSCGPIFRANMNTGDPSKPKSVILKALPCEPAGLHEVQDFLGRIQF 
HQYLGKHKNLVQLEGCCTEKLPLYMVLEDVAQGDLLGFLWTCRRDVMTMDGLLYDLTEKQ 
VTHIGKQVLLA^EFLQEKIILFHGDVAARNILMQSDLTAKLCGLGIAYEVYTRGAISSTQT 
IPLKWLAPERLLLRPASIRADWSFGILLYEIWTLGAPPYPEVPPTSILEHLQRRKIMKR 
PSSCTHTMYSIMKSCWRWREADRPSPRELRLRLEAAIKTADDEAVLQVPELVVPELYAAV 
AGIRVESL.FYNYSML 

SEQ ID NO: 23 6_AA0 98024_M 
LQEKHLFHGDVAAIINILIQSDLTPKX.GHLG 

LRPASIRGDIWSFGILLYEIWTLGAPPYPEVPPTSILQYLQRKKIMKRPSSCSHAMYNIM 
KCCWRWSEDSRPLLVQLLQRLEAASRSADDKAVLQVPELVVPELYADVAGIRAESISYSF 

SVL 

SEQ ID NO: 2 3 7_SGK2ALPHA_K 

MNSSPAGTPSPQPSRANGNINLGPSANPNAQPTDFDFLKVIGKGNYGKVLLiAKRKSDGAF 
YAVKVLQKKSILKKKEQSHIMAERSVLLKNVRHPFLVGLRYSFQTPEKLYFVLDYVNGGE 
LFFHLQRERRFLEPRARFYAAEVASAI GYLiHSLNI I YRDLKPENI LLDCQGHVVLTDFGL 
CKEGVEPEDTTSTFCGTPEYLAPEVLRKEPYDRAVDWWCLGAVLYEMLHGLPPFYSQDVS 
QMYENILHQPLQI PGGRTVAACDLLQSLLKKDQRQRLGSKADFLEI KNHVFFSPINWDDL 
YHKRLTPPFNPNVTGPADLKHFDPEFTQEAVSKSIGCTPDTVASSSGASSAFLGFSYAPE 

DDDILDC 
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FIGURE IBB 

SEQ ID NO: 2 3 8_CCRK_H 

MDQYC I LGR I GEGAHG I YFKAKHVETGE 1 1 ALKKVALRRLEDGF PNQALRE I KALQEKED 
NQYWQLKAVFPHGGGFVIJ^FEFMLSDI^EVVRHAQRPI^^ 

NNI VHRDLKPANLLI SASGQLKI ADFGLARVFSPDGSRLYTHQVATRS VGCIMGELLNGS 
PLFPGKNDIEQLCYVLRILGTPNPQVWPELTELPDYNKI SFKEQVPMPLEEVLPDVSPQA 
LDLLGQFLLYPPHQRI AASKALLHQYFFTAPLPAHPSELPI PQRLGGPAPKAHPGPPHI H 
DFHVDRPLEGVAVEPRADSALHPGGVRSWPWSRLPAPQDHSVHLFLCHLPGFTLQGLPMA 
TVGPHHTLPLSPCEGWSRGRGHVPSQEYENIQSSRGDSWPVLGEPYLLCATDVPIRTVSS 
AASQGLHMQNDDACLGAASPECCLLVKEKCRE 

SEQ ID NO: 2 3 9_TESK2_H 

MDRSKRNS I AGFPPRVERLEEFEGGGGGEGNVSQVGRVWPSS YRALI SAFSRLTRLDDFT 
CEKIGSGFFSEVFKVRHRASGQVMALKMNTLS 

NLEQLLDSNLHLPWTVRVKLAYDIAVGLSYLHFKGIFHRDLTSKNCLIKRDENGYSAVVA 
DFGLAEKIPDVSMGSEKiAVVGSPFWMAPEVLRDEPYNEKADVFSYGIILCEI IARIQAD 
PDYLPRTENFGLDYDAFQHMVGDCPPDFLQLTFNCCNMDPKLRPSFVEIGKTLEEILSRL 
QEEEQERDRKLQPTARGLLEKAPGVKRLSSLDDKI PHKS PCPRRTI WLSRSQSDI FSRKP 
PRTVSVLDPYYRPRDGAARTPKVNPFSARQDLMGGKIKFFDLPSKSVISLVFDLDAPGPG 
TMPLiADWQEPIoAPPIRRWRSLPGSPEFLHQEACPFVGREESLSDGPPPRLSSLKYRVKEI 
PPFRASALPAAQAHEAMDCSI LQEENGFGSRPQGTSPCPAGASEEMEVEERPAGSTPATF 

STSG IGLQTQGKQDG 
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FIGURE 2A 

SEQ ID NO: 1_X69117_H BARK2_H 

ATGGCGGACCTGGAGGCCGTGCTGGCCGATGTCAGTTACCTGATGGCCATGGAGAAGAGC 

AAGGCGACCCCGGCCGCCCGCGCCAGCAAGAGGATCGTCCTGCCGGAGCCCAGTATCCGG 

AGTGTGATGCAGAAGTACCTTGCAGAGAGAAATGAAATAACCTTTGACAAGATTTTCAAT 

CAGAAAATTGGTTTCTTGCTATTTAAAGATTTTTGTTTGAATGAAATTAATGAAGCTGTA 

CCTCAGGTGAAGTTTTATGAAGAGATAAAGGAATATGAAAAACTTGATAATGAGGAAGAC 

CGGCTTTGCAGAAGTGGACAAATTTATGATGCGTACATGATGAAGGAAGTTGTTTGGTGT 

TCACATCCTTTCTCAAAGGAAGCTGTAGAACAGGTACAAAGTGATTTATCGAAGAAACAA 

GTGACATGAACTCTTTTTCAGGCATACATAGAAGAAATTTGTGAAAGGCTTCGAGGTGAC 

ATTTTTCAAAAATTTATGGAAAGTGACAAGTTCACTAGATTTTGTCAGTGGAAAAACGTT 

GAATTAAATATCCATTTGACCATGAATGAGTTCAGTGTGCATAGGATTATTGGACGAGGA 

GGATTCGGGGAAGTTTATGGTTGCAGGAAAGCAGACACTGGAAAAATGTATGCAATGAAA 

TGCTTAGATAAGAAGAGGATGAAAATGTU^CAAGGAGAAACATTAGCTTTAAATGAAAGA 

ATCATGTTGTCTCTTGTCAGCACAGGAGACTGTCCTTTCATTGTATGTATGACCTATGCC 

TTCCATACCCCAGATAAACTCTGGTTCATGCTGGATCTGATGAACGGGGGCGATTTGCAG 

TACCACCTTTCACAACACGGTGTGTTCTCTGAGAAGGAGATGCGGTTTTATGCCACTGAA 

ATCATTCTGGGTCTGGAACACGTGCACAATCGGTTTGTTGTCTACAGAGATTTGAAGCCA 

GCAAATATTCTCTTGGATGAACATGGACACGCAAGAATATCAGATCTTGGTCTTGCCTGC 

GATTTTTCCAAAAAGAAGCGTCATGCGAGTGTTGGCACCCATGGGTACATGGGTCCCGAG 

GTGCTGCAGAAGGGGACGGCCTATGACAGCAGTGCCGACTGGTTGTCCGTGGGCTGCATG 

CTTTTCAAACTTCTGAGAGGTCACAGCCCTTTCAGACAACATAAAAGCAAAGACAAGCAT 

GAAATTGACCGAATGACACTCACCGTGAATGTGGAAGTTGCAGAGACGTTGTGTCCTGAA 

CTGAAGTCCCTTTTGGAGGGCTTGCTTCAGCGAGACGTTAGCAAGCGGGTGGGGTGTCAG 

GGAGGCGGCTCACAGGAAGTAAAAGAGCACAGCTTTTTCAAAGGTGTTGACTGGCAGCAT 

GTCTACTTACAAAAGTACCCACCACCCTTGATTCCTCCGCGGGGAGAAGTCAATGCTGCT 

GATGCCTTTGATATTGGCTCATTTGATGAAGAGGATACCAAAGGGATTAAGCTACTTGAT 

TGCGACCAAGAACTCTACAAGAACTTCCCTTTGGTCATGTCTGAAGGCTGGCAGCAAGAA 

GTAACGGAAACAGTTTATGAAGCAGTAAATGCAGACACAGATAAAATCGAGGCCAGGAAG 

AGAGCTAAAAATAAGCAACTTGGCCAGGAAGAAGATTACGGTCTGGGGAAGGACTGTATT 

ATGCACGGGTACATGCTGAAACTGGGAAAGCCATTTGTGACTCAGTGGCAGCGTCGCTAT 

TTTTACCTCTTTCCAAATAGACTTGAATGGAGAGGAGAGGGAGAGTCCCGGCAAAATTTA 

CTGACAATGGAACAGATTCTCTCTGTGGAAGAAACTCAAATTAAAGACAAAAAATGCATT 

TTGTTCAGAATAAAAGGAGGGAAACAATTTGTCTTGCAATGTGAGAGTGATCCAGAGTTT 

GTGCAGTGGAAGAAAGAGTTGAACGAAACCTTCAAGGAGGCCCAGCGGCTATTGCGTCGT 

GCCC CGAAGTTCCTCAACAAACCTCGGTCAGGTACTGTGGAGCTCCCAAAGCCATCCCTC 

TGTCACAGGAACAGCAACGGCCTCTGA 

SEQ ID NO: 2_AA144 574__M BARK2_M 

CTGCTTCGTAGTCTACAGAGACCTGAAGGCTGCGAACATCCTCCTAGATGAATATGGGCA 
CGTGAGGATATCGGATGTCGGCCTTGCGTGTGATTTCTCCAAAAAGAAGGCTCATGCGAG 
GGTGGGGACGCATGGGTACATGGCTCCCGAGGTGTTGCAGAAGGGAAGGTGCTATGACAG 
CAGCGGCGAGTGGTTGTCCGTGGGCTGTATGCTCTTCAAAGTTGTGCGGGGCCACAGCCC 
GTTCAGGCAGCATAAAACCAAAGACAAGCATGAGATAGAGCGAATGAGGGTGAGCGTGAA 
CGTGCAGGTTCCAGATGCGTTCTCGCGTGAGGTGAGGTCGGTGTTAGAGGGTTTGCTCGA 
GCGGGACGTGAGCCAGGGGCTGGGCTGGGGAGGAGGAGGGGGACGAGAGTTGAAGGAGCA 
CATCTTGTTCAAGGGCATTGAGTGGCAGCATGTGTAGTTACGGAAGTAGCGGCGACCCGT 
AATCCCTCCTCGGGGAGAGGTCAACGCTGCAGATGCCTTCGATATCGGCTCCTTCGATGA 
GGAAGACACCAAAGGCATTAAGCTGTTGGACTGTGACCAGGACCTCTATAAGAACTTCCC 
ACTGGTGATGTCCGAGGGGTGGCAGGAAGAAGTGGTGGAGAGCATGTATGACGCCGTCAA 
TGCTGATACTGATAAAATGGAGGCCAGGAAGAAGGGTAAAAATAAGCAAGTTGGTGAAGA 
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FIGURE 2B 

G G AAG A TT A C G C T A T G G G G AA G G A C T G C A T C AT G C A C G G G T A C A T G C T G AA G C 7 G G G G AA 
CCCCTTTCTCACACAGTGGCAAAGACGCTATTTTTACCTGTTCCCCAACAGACTGGAGTG 
G AG AGG AG AG G G CG AGTC T GGG G AAAGTC T A CTG A.C C A TGGAA GAG AT GATGTCTGTGG A 
GGAGACCCAGA*TTAAAGAGAGAAAGTGCATCTTACTCAGGATAAAGGGAGGGAAGCAATT 
TGTGTTGCAATGTGAGAGTGACCCCGAGTTTGCACAGTGGCTGAAGGAGCTGACCTGGAC 
CTTCAATGAGGCCCAGAGACTGCTGCGCCGTGCCCCCAAATTCCTCAACAAACCACGGGC 
CGCCATCCTGGAGTTCTCCAAGCCACCACTGTGTCACAGAAATAGCAGCGGCCTCTGAAC 
CACAGAGCAGCGGGGCCTGAAGGAGGGGCGCCAGGTGTTCAGCCGAGGAGTGGAACGAAG 
CCACGGGGAACGGTGTGGGGGTAAGACACAGTGTTTCTGAGCACTGACGGGGCTGCTCGA 
AGCCGAGGAGGCTCAGGACACCAGGGCGGCCTTCTGGGAGCTGGGACATCCTCGGGGCTG 
TCGTATCCACACTCGAAATTACTGAAGAAGCAGAGGCATTCTGGTGTG 

SEQ ID NO: 3_AA826850_H 

GAAGAGGATGGGCTCGTCCATGTCGGGGGGCACCGCGCGGAGGCGGGTGTTTGAGGACAA 
GGAGGAGGTGAACTTCGACCACTTCCAGATCCTTCGGGCCATTGGGAAGGGCAGCTTTGG 
CAAGGTGTGCATTGTGCAGAAGCGGGACACGGAGAAGATGTACGGCATGAAGTACATGAA 
CAAGCAGCAGTGCATCGAGCGCGACGAGGTCCGCAACGTCTTCCGGGAGCTGGAGATCCT 
GCAGGAGATCGAGCACGTCTTCCTGGTGAAGGTCTGGTACTCGTTCCAGGACGAGGAGGA 
CATGTTCATGGTCGTGGACCTGCTAGTGGGGGGGGAGGTGCGGTAGCACCTGGAGCAGAA 
GGTGCAGTTCTCCGAGGACAGGGTGAGGGTGTACATGTGCGAGATGGCACTGGCTGTGGA 
CTACCTGGGCGGCCAGGACATCATCCACAGAGATGTCAAGGCTGAGAACATTCTGCTGGA 
TGAGAGAGGACATGCACACCTGACCGACTTCAACATTGCCACCATCATCAAGGACGGGGA 
GCGGGCGACGGCATTAGCAGGCACCAAGCCGTACATGGCTCCGGAGATCTTCCAXTCTTT 
TGTCAACGGGGGGACCGGCTACTCCTTCGAGGTGGACTGGTGGTCGGTGGGGGTGATGGC 
CTATGAGCTGCTGCGAGGATGGAGGCCCTATGACATCCACTCCAGCAACGCCGTGGAGTC 
GCTGGTGGAGCTGTTCAGCACGGTGAGGGTCCAGTATGTCCCCACGTGGTGCAAGGAGAT 
GGTGGGCTTGCTGCGGAAGCTCCTCACTGTGAACGGCGAGCAGCGGCTCTGCAGCCTGCA 
"'tSGACGTGCAGGCAGCCCCGGCGCTGGCCGGCGTGCTGTGGGACCACCTGAGCGAGAAGAG 
GGTGGAGCGGGGCTTCGTGCCCAACAAAGGCCGTCTGCACTGCGAGGGCAGCTTTGAGCT 
GGAGGAGATGATCCTGGAGTCCAGGCCGCTGGACAAGAAGAAGAAGGGCGTGGGCAAGAA 
CAAGTCCCGGGACAAGAGCAGGGACAGCTGCGAGTCCGAGAATGACTATCTTCAAGACTG 
CCTCGATGCCATCCAGCAAGACTTGGTGATTTTTAACAGAGAAAAGGTGAAGAGGAGCCA 
GGACGTCCCGAGGGAGCGTCTCCCCGCCGGTGAGTGCAGGGATGCTGCGGAGCCTGTGGA 
GGACGAGGCGGAACGGTCCGCCCTGCCCATGTGCGGCCCCATTTGCCCCTCGGCCGGGAG 
CGGCTAGGCGGGGATGCCCGTGGTGCTCACCCGTTGAGCTGCTTTGGAGACTCGGCTGCC 
AGAGGGAGGGGCATGGGCCGAGGCGTGGGATTCAGGTTGCGAGGCAGCGTGGGTGGCGGT 
GCCCACAGTGCCCCGGACACATTTGACACCTCAGGGTGGTGGTGGTGCAGGGGACAAGAG 
GCTGTGGGTGGAGGGGACACCTGTGGAGGGCATTTCCCGTGGGCCGCCGAGACCCGCGTA 
GATGGAGGAAGCGCTGGTGGGCGCCCTCTTACCGGTCACGGGGAGCTGGGGCCATGGATG 
GGAGAGGAGTCTTTGTCCCTGCTCAGGCCGGAGGCTGTGCACGGCCCTGGTCACAAGGTG 
ACCGTTGCAGCACAGGCCGCGGGTGGCCCAGGGTGGGGTGAGTTGTTGGAGGTGAAGGGC 
ATGGGTTGGGGTAGTGGGTGGGGAGGTGAATGTTTTCTAGAGATTCAAAGTGGTCCAGCA 
ATTTCTGTATAGTTTTCAGCTGTGAGAATTACAATGTGAGAAGCGCTC 

SEQ ID NO: 4_AA96 0 95 7__H 

GTCCCACATCCCGCATCCGGCATCCCAGCGGCCGGGCATGTAGCAGCGGCAGCAACGGCG 
GAATATGGGCGGGAACCACTCCCACAAGCCCCCCGTGTTTGACGAGAATGAGGAAGTCAA 
CTTTGACCATTTTCAGATTCTGCGGGCCATTGGTAAAGGGAGTTTTGGAAAGGTATGCAT 
CGTGCAGAAGCGAGACACTAAGAAAATGTATGCAATGAAGTACATGAACAAGCAGAAGTG 
CATCGAGAGGGATGAGGTTCGGAATGTTTTCCGGGAGCTGCAGATGATGCAAGGGCTGGA 



W \\\ 



WO 00/73469 



PCT7US00/ 14842 



FIGURE 2C 

GCACCCCTTCCTGGTCAATCTGTGGTACTCCTTCCAGGATGAGGAGGACATGTTCATGGT 

GGTGGACCTGCTCCTGGGAGGCGACCTGCGCTACCATCTGCAGCAGAATGTGCATTTCAC 

AGAGGGGACTGTGAAACTCTACATCTGTGAGCTGGCACTGGCCCTGGAGTATCTTCAGAG 

GTACCACATCATCCACAGAGACATCAAGCCAGACAATATCCTGCTGGATGAACACGGACA 

TGTTCACATTACAGACTTCAACATAGCGACGGTAGTGAAAGGAGCAGAAAGGGCTTCCTC 

CATGGCTGGCACCAAGCCCTACATGGCTCCAGAAGTATTCCAGGTGTACATGGACAGAGG 

CCCCGGATACTCGTACCCTGTCGACTGGTGGTCCCTGGGCATCACAGCCTATGAGCTGCT 

GCGGGGCTGGAGGCCGTACGAAATCCACTCGGTCACGCCCATGGATGAAATCCTGAAGAT 

GTTCAAGGTGGAGCGTGTCCACTAGTCCTCCACGTGGTGCAAGGGGATGGTGGCCCTGGT 

GAGGAAGCTCCTGACCAAGGATCCTGAGAGCCGGGTGTGCAGCCTTCATGACATACAGAG 

CGTGCCCTACTTGGCCGACATGAACTGGGACGCGGTGTTCAAGAAGGCACTGATGCCGGG 

CTTTGTGCCCAATAAAGGGAGGTTGAACTGCGATCCCACATTTGAGCTTGAAGAGATGAT 

TCTAGAATCCAAGCGACTTCACAAAAAGAAGAAGCGATTGGCAAAGAACAGATCCAGGGA 

TGGCACAAAGGACAGCTGCCCGCTGAATGGACACCTGCAGCACTGTTTGGAGACTGTCCG 

GGAGGAATTCATCATATTCAACAGAGAGAAGCTCAGGAGGCAGCAGGGACAGGGCAGCCA 

GCTCTTGGACACCGACAGCCGAGGGGGAGGCCAGGCCCAAAGCAAGCTCCAGGACGGGTG 

CAACAACAACCTCCTCACCCACACCTGCACCCGTGGGTGCAGCAGCTGAGCCCACACTTG 

TTGCTGCTCAACAGGACTGCACTCGTCTCTGCCGTGCCCACCCAGAGCCCCTCTTTGTGG 

CCTGATGGTCCCTGTCTCACCCCTGAAAACATCAGATGCAGAAAAAGCCCTGGACTTGGA 

GCTGGGAAGCCTGGGTTCTGGTCCCATCTCCATGAGTGATTCACGTGTGACCTCAGAGAA 

GTCACGCCCTCTCTGTGCCTCCGTTTTCTGCATCTGCCAAAGGGGTTAAACACTTCTGGC 

CCACTTCAAATTACAAGATTATGGGGAGAAGCCAATTAGGTAGGAAACATGAAAAACCTT 

TGATATTTATAAAATCATTTTTACGTGCAAAATATAACCTTAATATTTGAAGTGACGCGC 

ATTCCCCAAAGCAATCAAACCGTCATGACTTTGCAATTTGGCACATCCTAGCTTGTTAGA 

GGGCACTTCCGAAAAACACAGCCCTGACAGCAAAATAAAGGTGTGATATGTTGGCCCCTT 

CTATGGAAACAACGCTGCCAAATCCTGGAGCAAAACCTGAAGTGTCTTCATGTGCATTGT 

CTGGCAGGCCACAGTCCTGAGCTTGTAAGATGGTGCAGGATGCAGACCAGACTTGTGCCG 

AAGGTCTCAGCGCTGCGGTCTCACTCCTCCCCTCATTTAAGAAGACTATGCTTACGTTTT 

AGTTTCAGCAGTCCTCACCACCACCATATCCCCAGTGCTGGGATGGCACACAGGTGTCCA 

TTCAGATGAGAGTTGGGTCGCTGAGCATTGGTTACTCCTGCAGAGTGTAATCAGGACGGG 

ATCCAACTGGCCCGAAAGCCCAGACCTGCAGCAGAACTCTCCAACTGTGTATCAGCTTTC 

AGGGTTTTCTCTCCTGGGAAGGGTGTAAAATCAGCTTGTCAGATTCTTCTTACAGAGAGT 

ATCCAATCGGTATTGGTGGAGCGGCTCCCTATTTATACAATAGGAAGCATGGGTGCTTAG 

AAAGTTTATTTCAGGAGGAAAATGGGTTCACACAAAAAGCAAACTACATTCTGATCTGCT 

CAGGGAGAAGCTTGCCTTTGAACTGGAAGATGTTGGGATGAGCAGGGAAAGCTTAGACTT 

TGGAGTCAGGTTTGTGTTCAGAATCCAGCCCTGCTGGCTAGTAACTAACTGGGAGACCTT 

AGGCAAAGCATGCAATCGCTCTGAATGGCAGTTTCCTCATTTTTAAACAGGGATAATAAA 

ACTAATATTGCAGGGGAGTTACAGGGTTAAATAAGATCCTGTGTGTAACCCCAAGCATTG 

GATGACTCATAGAATGGGGTTTTTTGTCAGCATAATGGTCATCATTATTTAGATAGTTTC 

TTCCTTCACTCACCCAGCAGGTCAGTTTTGTGTGCAAACAAAGGTGTTTAGGATTCTTGG 

AAATGTTGTTCCTGGGGTGTTTGATATTTGTTTGTTACATCCTGCTGAAGTTCGAGTGTG 

TTTTTATTTTTTCATCCAACTTCCATTTTTCACTTTTTACATGATTACTCAATCCTTGGG 

GCTGTCCATGTCATCTCTTAGATTTCTTAAAAGACATTTTAATGTATGGTTAGGTTTTAT 

ATTTTTATTTTTTAAAAAAGAAATAGTCAGTGTTTTCCTGCTTTGAACCGAGAGTATTTG 

TGGATTGTGTGCTCGTCGTCAGTTGACTTGTTTTGCACAGTTTTGTTTACTTGATGTCCC 

CATCAACAACCGTCCTGGTCGCCACCTCCCCCAGGAAATAAGGGGCGTGGTCCTGTGCCT 

AGTGTGAGCCTGGAGGGTCTTAAGATGATGATGGTTTTTTTTATTGGGCTGAGTTCACGA 

ATTAGGGGCAGGAGCTGGAAGTCGCCCTAGGAACACCAGATTTCCTGGTTCTGTTCAAGT 

TGGCATTTCTTGTTTGGAATAAACTATTTCTTGGACATTCCTTC 
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FIGURE 2D 

SEQ ID NO: 5_T3K1__H 

TGGTGAGTGrGGAGGAGGGGGGGGGAGCGGGGGGGGGGTGGGGGGGGGGAGAGGGGGCTG 
GTATAACAAGAGGATTGGCTGATCGAGCCAAGATGCAGAGCACTTCTAATCATCTGTGGG 
TTTTATCTGATATTTTAGGCCAAGGAGCTACTGCAAATGTCTTTCGTGGAAGACATAAGA 
AAACTGGTGATTTATTTGCTATCAAAGTA/TTTAATAACATAAGCTTCCTTCGTCCAGTGG 
ATGTTCAAATGAGAGAATTTGAAGTGTTGAAAAAACTCAATCACAAAAATATTGTCAAAT 
TATTTGCTATTGAAGAGGAGACAACAAGAAGACATAAAGTAGTTATTATGGAATTTTGTG 
CATGTGGGAGTTTATACACTGTTTTAGAAGAACGTTCTAATGCCTATGGACTACCAGAAT 
CTGAATTCTTAATTGTTTTGCGAGATGTGGTGGGTGGAATGAATCATCTACGAGAGAATG 
GTATAGTGCACCGTGATATCAAGCCAGGAAATATCATGCGTGTTATAGGGGAAGATGGAC 
AGTCTGTGTACAAACTCACAGATTTTGGTGCAGCTAGAGAATTAGAAGATGATGAGCAGT 
TTGTTTCTCTGTATGGCACAGAAGAATATTTGCACCCTGATATGTATGAGAGAGCAGTGC 
TAAGAAAAGATCATCAGAAGAAATATGGAGCAACAGTTGATCTTTGGAGCATTGGGGTAA 
CATTTTACCATGCAGCTACTGGATCACTGCCATTTAGACCCTTTGAAGGGCCTCGTAGGA 
ATAAAGAAGTGATGTATAAAATAATTACAGGAAAGCCTTCTGGTGCAATATCTGGAGTAC 
AGAAAGCAGAAAATGGACCAATTGACTGGAGTGGAGACATGCCTGTTTCTTGCAGTCTTT 
CTCGGGGTCTTCAGGTTCTACTTACCCCTGTTCTTGCAAACATCCTTGAAGCAGATCAGG 
AAAAGTGTTGGGGTTTTGACCAGTTTTTTGCAGAAACTAGTGATATACTTCACCGAATGG 
TAATTCATGTTTTTTCGCTACAACAAATGACAGCTCATAAGATTTATATTCATAGCTATA 
ATACTGCTACTATATTTCATGAACTGGTATATAAAGAAACCAAAATTATTTCTTCAAATC 
AAGAACTTATCTACGAAGGGCGACGGTTAGTCTTAGAACCTGGAAGGCTGGCACAACATT 
TCCCTAAAACTACTGAGGAAAACCGTATATTTGTAGTAAGCCGGGAACCTCTGAATACCA 
TAGGATTAATATATGAAAAAATTTCCCTCCCTAAAGTACATCCACGTTATGATTTAGACG 
GGGATGCTAGCATGGCTAAGGCAATAACAGGGGTTGTGTGTTATGCCTGCAGAATTGCCA 
GTACCTTACTGCTTTATCAGGAATTAATGCGAAAGGGGATACGATGGCTGATTGAATTAA 
TT AAAG AT G ATT A C AA TG AAA C TGTTCACAAAAAG AC AG AAGTTGTGATCAC A TTGG ATT 
TCTGTATCAGAAACATTGAAAAAACTGTGAAAGTATATGAAAAGTTGATGAAGATCAACC 
"TGGAAGCGGCAGAGTTAGGTGAAATTTCAGACATACACACCAAATTGTTGAGACTTTCCA 
G TT CT CAG GG AA C AA TAG AAAC CAGTCTTCAGG AT AT CG AC AG C AG ATT ATCTCC A GGTG 
GATCACTGGCAGACGCATGGGCACATCAAGAAGGCACTCATCCGAAAGACAGAAATGTAG 
AAAAACTACAAGTCCTGTTAAATTGCATGACAGAGATTTACTATCAGTTCAAAAAAGACA 
AAGCAGAACGTAGATTAGCTTATAATGAAGAACAAATCCACAAATTTGATAAGCAAAAAC 
TGTATTACCATGCCACAAAAGCTATGACGCACTTTACAGATGAATGTGTTAAAAAGTATG 
AGGCATTTTTGAATAAGTCAGAAGAATGGATAAGAAAGATGCTTCATCTTAGGAAACAGT 
TATTATCGCTGACTAATCAGTGTTTTGATATTGAAGAAGAAGTATCAAAATATCAAGAAT 
ATACTAATGAGTTACAAGAAACTCTGCCTCAGAAAATGTTTACAGCTTCCAGTGGAATCA 
AACATACCATGACCCCAATTTATCCAAGTTCTAACACATTAGTAGAAATGACTCTTGGTA 
TGAAGAAATTAAAGGAAGAGATGGAAGGGGTGGTTAAAGAACTTGCTGAAAATAACCACA 
TTTTAGAAAGGTTTGGCTCTTTAACCATGGATGGTGGCCTTCGCAACGTTGACTGTCTTT 
AGCTTTCTAATAGAAGTTTAAGAAAAGTTTCCGTTTGCACAAGAAAATAACGCTTGGGCA 
TTAAATGAATGCCTTTATAGATAGTCACTTGTTTCTACAATCCAGTATTTGATGTGGTCG 
TGTAAATATGTACAATATTGTAAATACATAAAAAATATACAAATTTTTGGCTGCTGTGAA 
GATGTAATTTTATCTTTTAACATTTATAATTATATGAGGAAATTTGACCTCAGTGATCAC 
GAGAAGAAAGCCATGACCGACCAATATGTTGACATACTGATCCTCTACTCTGAGTGGGGC 
TAAATAAGTTATTTTCTCTGACCGCCTACTGGAAATATTTTTAAGTGGAACCAAAATAGG 
CATCCTTACAAATCAGGAAGACTGACTTGACACGTTTGTAAATGGTAGAACGGTGGCTAG 
TGTGAGTGGGGAGCAGAACCGCACCACTGTTATACTGGGATAACAATTTTTTTGAGAAGG 
ATAAAGTGGCATTATTTTATTTTACAAGGTGCCCAGATCCCAGTTATCCTTGTATCCATG 
TAATTTCAGATGAATTATTAAGCAAACATTTTAAAGTGAATTCATTATTAAAAACTATTC 
ATTTTTTTCCTTTGGCCATAAATGTGTAATTGTCATTAAAATTCTAAGGTCATTTCAACT 
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GTTTTAAGCTGTATATTTCTTTAATTCTGCTTACTATTTCATGGAAAlAAAATAAATTTC'] 1 
C AAT T T T AAAAAA 

SEQ ID NO: 6_AA3 0 5 176_H 

TGGCTGCTCGCGGAGGGGCAGTGTACGCGGGGCCGCTGTAGGCTGTCCAGCGATGGATCC 
CACCGCGGGAAGCAAGAAGGAGCCTGGAGGAGGCGCGGCGACTGAGGAGGGCGTGAATAG 
GATGGGAGTGCCAAAACCGCGGTCCATTGAGGAATTCAGCATAGTGAAGCCCATTAGCCG 
GGGCGCCTTCGGGAAAGTGTATCTGGGGCAGAAAGGCGGCAAATTGTATGCAGTAAAGGT 
TGTTAAAAAAGCAGACATGATCAACAAAAATATGACTCATCAGGTCCAAGCTGAGAGAGA 
TGCACTGGCACTAAGCAAAAGCCCATTCATTGTCCATTTGTATTATTCACTGGAGTCTGC 

aaacaatgtctacttggtaatggaatatcttattgggggagatgtcaagtctctcctaca 
tatatatggttattttgatgaagagatggctgtgaaatatatttctgaagtagcactggc 
tctagactaccttcacagacatggaatcatccacagggacttgaaaccggacaatatgct 
tatttctaatgagggtcatattaaactgacggattttggcctttcaaaagttactttgaa 
tagagatattaatatgatggatatccttacaacaccatcaatggcaaaacctagacaaga 
ttattcaagaaccccaggacaagtgttatcgcttatcagctcgttgggatttaacacacc 
aattgcagaaaaaaatcaagagcctgcaaacatcctttcagcctgtctgtctgaaacatc 
acagctttctcaaggactcgtatgccctatgtctgtagatcaaaaggacactacgcctta 
ttctagcaaattactaaaatcatgtcttgaaacagttgcctccaacccaggaatgcctgt 
gaagtgtctaacttctaatttactccagtctaggaaaaggctgggcacatcgagtgccag 
tagtcaatcccacaccttcatatccagtgtggaatcagaatgccacagcagtcccaaatg 
ggaaaaagattgccaggtttgagggacatttatgttaatgaaaatcaattatgtatgtca 
aatgaatgtgagaaatattataccttttcatataaattccataaagaaatgaaattgtta 
catgaatggcagtcatagtattaatcagaaattcattttcctgcacattctgtcaaattc 
ttttgaaatatttcatttctcattcaattgtgacattgttcttacttgattatataatga 
gattcttgcagtaaattgataataaatgcttggcttgtgtgtatgtaggtggacctcact 
tgtttttagaagtccttcccatgatacagacattggcttgttggttttgttttattttgt 
ttttaagatatgtcatttaaaaactcatattacctcctttt 

seq id no: 7_aa116841_m 

ccacgcgtccgatcccatggccagaaggcgaagaaaagctatctgataatgctcaaagtg 
caatggacatgcttttaaccattgatgattgaaagagagctggaatgagagaactaaaac 
agcatcctctcttcagtgaagtggactgggaaaatctgcagcatcagactatgcctttcg 
taccccaaccagacgacgaaac^gatacatcctattttgaagccagajLataatgctcaac 
atctgaccgtatctgggtttagtctgtagcacatgcgtgtcatttttatctaacttgtga 
tatagaattaagttttacagtaatatgctacttaatactagattggtctaaatgggataa 
aagtcattattttacccagactgaacagcttttaattactaagtacaacagtttttacag 
aattaaaatactataagcaatataatcagtaattaatctttaccttagaactgtatataa 

GCCATAATAGCTTTTTTCATCTTATTTATTCACTGCACTTTATGAAGAGCAAAGTATCAA 
TAAACTAAAACACTACCACTCTAAATAGAGGGAGTGAGCCGT 

SEQ ID NO: S_AA256100_H 

AGGGAGCTGACGGGCGCCCGGCCGGCTGCGGTCCGTGCGGAGGCTGAGCCGGCCGCGGGC 
GCGACCGGAGGCAGTTTCCGTTACTATGGCAATGACGGCAGGGACTACAACAACCTTTCC 
TATGAGCAACCATACCCGGGAAAGAGTGACTGTAGCCAAGCTCACATTGGAGAATTTTTA 
TAGCAACCTAATTTTACAGCATGAAGAGAGAGAAACCAGGCAGAAGAAATTAGAAGTGGC 
CATGGAAGAAGAAGGATTAGCAGATGAAGAGAAAAAGTTACGTCGATCACAACACGCTCG 
CAAAGAAACAGAGTTCTTACGGCTCAAAAGGACCAGACTTGGCTTGGATGACTTTGAGTC 
TCTGAAAGTTATAGGAAGAGGAGCTTTTGGAGAGGTGCGGTTGGTCCAGAAGAAAGATAC 
AGGCCATATCTATGCAATGAAGATATTGAGAAAGTCTGATATGCTTGAAAAAGAGCAGGT 
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GATGTTTTACAGTTTTCAGGATAAGAGGAATCTTTATCTAATCATGGAATTTCTCCCTGG 
AGGTGACATGATGACATTGCTAATG/iAGAAAGACACCTTGACAGAAGAGGAAACACAGTT 
rTACATTTCAGAGACTGTTCTGGCAATAGATGCGATCCACCAGTTGGGTTTCATCCATCG 
GG ATATTAAGGCAG AC AAGGTTTTATTGGATGCGAAGGGTCATGTAAAATTATCTGATTT 
TGGTTTATGTACGGGATTAAAGAAAGCTCACAGGACTGAATTTTATAGAAATCTCACACA 
CAACCGACCAAGTGACTTCTCATTTCAGAACATGAACTCAAAGAGGAAAGCAGAAAGTTG 
GAAGAAGAACAGGAGACAACTGGCATATTCCACAGTTGGGAGACCAGATTACATTGGTCC 
AGAAGTATTCATGCAGACTGGTTACAACAAATTGTGTGACTGGTGGTCTTTGGGAGTGAT 
TATGTATGAAATGCTAATAGGATATCCACCTTTCTGCTCTGAAACACCTCAAGAAACATA 
CAGAAAAGTGATGAACTGGAAAGAAACTCTGGTATTTCCTCCAGAGGTACCTATATCTGA 
GAAAGCCAAGGACTTAATTCTCAGATTTTGTATTGATTCTGAAAACAGAATTGGAAATAG 
TGGAGTAGAAGAAATAAAAGGTCATCCCTTTTTTGAAGGTGTCGACTGGGAGCACATAAG 
GGAAAGGCCAGCAGCAATCCCTATAGAAATCAAAAGCATTGATGATACTTCAAATTTTGA 
TGACTTCCCTGAATCTGATATTTTACAACCAGTGCCAAATACCACAGAACCGGACTACAA 
ATCCAAAGACTGGGTTTTTCTCAATTATACCTATAAAAGGTTTGAAGGGTTGACTCAACG 
TGGCTCTATCCCCACCTACATGAAAGCTGGGAAGTTATGAATGAAGATAACATTCACCCA 
TAACCAAGAGAACTCAGGTAGCTGCATCACCAGGCTTGCTTGGCGTAGATAACAATACAC 
TGAAATACTCCTGAAGATGGTGGTGCTTATTGACTACAAGAGGAAATTCTACAGGATTAG 
GATTTCTAAGACTACTATAGGAATTGGTTGGCAGTGCCAGCTGGCTCTTTTTTTTAATAT 
TTTA.TTATTTTTGTTAACTTTATTATATGAAGGTACTGGAATAAAAGGAACAGACATCCC 
TTTCTAACTGCACTGCGTACATGCGTATTAAGGTCCATTCTGCCTGTGTGTGCTGTGGCT 
TTGAACTGTAACACCTCTAATCAATTCAGGAGAAACACATATCATTTAAAGCAACATAGG 
CTAACCTGTAGGTAACACTGCAGTATTGATGTTTTACTGCAAATCTTATGGGTCTAGATA 
ATCAGTAAAAGCCATCTTCCATAGTTGGTGTTAGAACATTGCCCTATTGGTTTGGACATC 
TGTAGAATATATATGAAGACAATTTCTGTAATGGTTTTAAGAGATTTAAAAAGAAATTCA 
u-iCTGGlTTCTTTACAAAATAGAATTTATCATCAAGTTATTACACAAACTTCACAGTAAGGAG 
TGACAAGTTTATAATAAGGAAGACAAAGTTTAACACCTTCACTCAAGCACTCCACTAATA 
TATTTACGTTGCATTCAGAAATACTGATGACCTTCATATACGTAGTCTGTATACTCATAG 
GGAGATGTACTGTATTATATAACATGTAAAGTTGATTTTCTTGTGACAAGAGAACTTCTT 
TTTTTAACAAGAGGACATGGCATTATTTTAATTTGATTATGGTGAGTTGAATTTAAGACA 
TGACCATGAAGGCTGCTTGTAGAATTAGTGTATTTTTATTAAACTATTTTTTTAAATGTC 
AAACTTCTATCATGTAAATGGACTTATAGAGAACAAAAAGCTATTTACTTTGGTTTTCTA 
GAAAGTTGTTACATATCATGGCTGGTTAACTTTTATTTCTTTTGATGAAAATTTTTCCTT 
TGATAGTACTTGTATTATTGTGCCATTATTTTCTTATGCTCCAAATGTACCAAAGATCTT 
GAACAGAGTGGATGTTCACAACTGAGTAGAATTTTCCTTTCCTGTGGGCATGCTGTATTC 
AGACCTGACAGATCTTTGATAGAGGTCAGCTTATTAAAGGGCAATATTGTTCTTGTTTAG 
CTACATCACTGTGGTGAATATAGATGGAATTAAGGAAGTAAATGCAGGCCAGGGGGTTGT 
GATGAGAGGATAGGGGAGATAATATCAGCATCAAATTCTTTGGGTATCTCTCTAAGAATT 
AAATAATCTTTTCTAGCTTAATATTTTAATTCTAATTCAAACAACTCTGAGGTTTTGGTT 
TCATTAGTAATAGTTGAGGAATAATATACTAGCAAAGAATGGCCTAATGTTTGTCATAAG 
TGTTAATGGATGAAATTTTTTAAAGATAGAAGCATGATAAGCATTATAAATGATCTATGA 
TCAAAATCTAAAGTGATGAATTATTTGTAGGAATGTCTTCCTAATGGGGAAGAATTGCAT 
AGGAGCATTATGCAAATCTACACAAGCTTTTATAAATGTTGCTGCTGGGTAGCTCCAGAG 
TGTTTCATAAGGCCATCCTGTTTCCCCCAACTCCCCCATTTTTGGTTTGTTTCTTTTTAA 
ATATTTGTTGAGTACTTACGTGTTTATCTAACAGTTCACTTCCATTTTTCTAGTCTGGAT 
TTTTTGAGTATTTAGGAAAGAGAGCTATTAAAAACTCTGGGGATTTCTCAATGTGACTAA 
GTGTAATTTTTCTAATTATAACTGCCTTTAATTAACATAATATTAACTTTTGCTGAGGTT 
TATGAGATTTTCTCACCCCACATCGCTCCCCTTTTTTTAAAAAGGACTGTTTTGCTAGTG 
TGATAATGAATAGGTAAGATATGAGATAATTGCAACATTGTCCTA.GTTCTAGTATGGTAA 
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CTATTCTTGAAATGGTATTGAAAAATACCGTTAATTCAAATTGACAGAGATTGATAAAAA 

GAAACTGATTTACCTAAGTTTAGTTTTTAATTGCATAATAGAGCATTTTTTGTTTTGAGT 

TCCCTCATTCTTATTACCAGAAAGAGCTTGCAAATAGTTTTACTTTCTTGGCACTGGAAG 

GGTAGTTCTGGAAAGCTACTTTGTTGAGAGTCTCATTCTTCCCTGGAGTTAATAGAGTGA 

TTCACAATCTTTGGGGTTTTCTCCTCATCAAAAGCATTTCTTAAGTGCCTATCTAAAAGC 

AATTAAAGACTGTGTCTGCGCTTTAGAAGCTAAGAATTTGATTGATGATGGAAATTAACT 

AGATAATTTGCAAAGTACCCTTGAGATTGAATTTTCTCTATTATATATTTGGCATATTTC 

AGGTGAATAATTTAATTTAAATGACAAAACGCTATGTAGTCAAGTGGGCATAATGACATT 

TTCTTTAAATTAGACTCTATTTTGAATTAAAAGAGTTTTATTATAAACCGTGTGTTTTTG 

GTTTTTCTAAGTATATAGAAAGCTTGTATAATTCAGATTTATGAATTTCCTGATTTAATG 

TAGACTTTGACTTTTTTATTAAAAACCTTTGTATTAAAGCAAGTTATGTTATTTTTCTTT 

TATGCATTTATTACTAACATAGCTTTAAATCTTTAAATGTATTGAAGCATTGTGCTGTCT 

GAAAATAAGGAATTGCTTATAAACCAGCCACTTCTGAATACAATATGTAGCTGATTTAAT 

AAGCTAGTTAGTGAATGGAAAATAAGTGTGGAGTATTAAAAATGTTCTTTGGTTGGTAAG 

GCCT/^AGATAGGGTTTCATTTATTTCTATACTTTTTCTGTTTTTTAAACACCTGCATATT 

TTTATGTAAATCTCTAAATTTAAAATATTTTAAGTACATTTATTTTTGGTGTTTTATTGT 

ATAAAACCTTAGACAATCAATCAGTCAGTCTTTACTGACAGGAGCAGCAGCTATCTGTCT 

TTTGCTGATCTACAAATAAATGAATTGAGAATTTAGTCCATAGAGGTCCCTGGCTACCAA 

ACACATTCTCCTTTGAATTGTTAAAATTCAGAACATTCAAAATAACTGTTTTGCTACAAG 

CCATGATTATTTTCCTGTTGTGTTTATTTAAATTTACTTTCTCTTTAGAAGTGGACTTAT 

TTCTGAAAAATCTTAATGAAACAAACGCTTAGAACAAATATAAATATGAGACACTTGGGA 

CTACTAGAGATATTTTAGATTTTTATGAAAAAAATGTGAGGGGATATTGCTGGTTTAAAA 

AGGAATAAAGTAATAAAAATATATCTCAGCTATTTTTTTAAAGCAATATAATTGAGCAAT 

TGTCTAGAAAAGTAATCATGAGGCTACTGAGTTTGGTGTTCAGTTACTGAGTTTCAAAAA 

TGTTTTGGTGGCATGAGGACAAAATTTCATTGAAGGTAAGATAAGAATAAAAACTATGTT 

TAC 

SEQ ID NO: 9_AA210825_H 

CACGAGGGCTACTGGCGCCTGGCGACCCTCCCTGCCGCCCACCCAACCCCGCTCCGGCAA 
CGCCCCCTTCCTCACGGCTCCCGAGCGAACTTTTCTCCAACTTCTGCGACTCGTGAGATT 
CCCTTCTACCCACTCCGGCCCTCGGGACCCCTCTGCCCATCCCCTGGCCGGTCGGGTCCC 
TGCGAACCCCTTTATCTCTGGAATCCACTCGGTCCCCGACTCAGAGACTCCTGCCCTCCA 
CCCCCAAGGACCCCGCCATCCTCAGGTCCCCTCCGCCTGCCAGATCTTTTCTCGGATCCC 
CGCTCTCCCACCACCTGCTCACGAGATCCCGCGGATCTAGAACCCAGGGTCCCCCGGGGC 
CCCCCGGCGGGTCCCGGGTGGGCTCCAGGCGGGCGGTCCCCGGCCTCCCCCCATGGCCAC 
CGCCCCCTCATTATCCCGCCGGGCTCCCTGGCTCTCCCGGGCCGGGGTCTCCTCGGGCCC 
CCGGCGGCCTAGAGCTGCAGTCGCCGCCACCGCTACTGCCCCAGATCCCGGGCCGGGGTT 
CCGGGGTCTCCTTTCACATCCAGATCGGGCTGACCCGCGAGTTCGTGCTGTTGCGGGGCG 
CCTCCGAGCTGGCTCATGTGAAGCAGCTGGCCTGTTCCATGGTGGAGCAGAAGTTGCCTG 
AGTGTGGCTTCTACGGCCTTTACGACAAGATCCTGCTTTTCAAACATGACCGCACGTCGG 
CCAACCTCCTGCAGCTGGTGCGCTCGTCCGGAGACATCCAGGAGGGCGACCTGGTGGAGG 
TGGTGCTGTCGGCCTCGGCCACGTTCGAGGAGTTCCAGATGGGCCCGCAGGCCGTCACGG 
TGCACTCCTATCGGGCGCCTGCCTTCTGTGATCACTGCGGGGAGATGCTCTTCGGCCTAG 
TGCGCCAGGGCCTCAAGTGCGATGGCTGCGGGGTGAAGTAGGACAAGGGCTGTGCCTTCA 
GCATCCCCAACAACTGTAGTGGGGCCCGCAAACGGCGCCTGTCATCCACGTCTCTGGCCA 
GTGGCCACTCGGTGCGCCTCGGCACCTGCGAGTCGGTGGCGTGGAGGGCTGAAGAGCTGA 
GCCGTAGCACCACCGAACTGCTGCGTCGGCGTGGGCCGTGATCCTCTTGCTGCTGTTCTG 
GCTCATCGTATACGGGGCGGCCCATTGAGCTGGACAAGATGCTGCTGTCGAAGGTCAAGG 
TGCCGCACACCTTCCTCATCCACAGCTATACACGGCCCACCGTTTGCCAGGCTTGCAAGA 
AACTCCTCAAGGGGCTCTTCGGGCAGGGCCTGCAATGCAAAGACTGGAAGTTTAACTGTC 




WO 00/73469 



PCT/USOO/14842 



FIGURE 2H 

ACAAACGCTGCGCCACCCGCGTrCCTAATGACTGCCTGGGGGAGGCCCTTATCAATGGAG 
ATGTGCCGATGGAGGAGGCCACGGATTTCAGCGAGGCTGACAAGAGCGCCCTCATGGATG 
AGTCAGAGGACTGCGGTGTCATCCCTGGCTCCCACTCAGAGAATGCGCTCCACGCCAGTG 
AGGAGGAGGAAGGCGAGGGAGGCAAGGCCCAGAGCTCCCTGGGGTACATCCCCCTAATGA 
GGGTGGTGCAATCGGTGGGAGAGAGGACGGGGAAATCCAGGAGCACGGTGCGGGAGGGTT 
GGGTGGTTCATTACAGCAACAAGGACACGCTGAGAAAGCGGCACTATTGGCGCCTGGACT 
GCAAGTGTATCACGCTCTTCCAGAACAACACGACCAACAGATACTATAAGGAAATTCCGC 
TGTCAGAAATCCTCACGGTGGAGTGCGCCGAGAACTTCAGGGTTGTGCCGCCGGGGAGGA 
ACCCACACTGCTTTGAGATCGTCACTGCCAATGCCACCTACTTCGTGGGCGAGATGCCTG 
GCGGGACTCCGGGTGGGCCAAGTGGGCAGGGGGCTGAGGCCGCCCGGGGGCTGGNNGAGA 
CAGCCATCCGCCAGGCCCTGATGCCCGTCATCCTTCAGGACGCACCCAGCGCCCCAGGCC 
ACGCGCCCCACAGACAAGCTTCTCTGAGCATCTCTGTGTCCAACAGTCAGATCCAAGAGA 
ATGTGGACATTGCCACTGTCTACCAGATCTTCCCTGACGAAGTGCTGGGCTCAGGGCAGT 
TTGGAGTGGTCTATGGAGGAAAACACCGGAAGACAGGCCGGGACGTGGCAGTTAAGGTCA 
TTGACAAACTGCGCTTCCCTACCAAGCAGGAGAGCCAGCTCCGGAATGAAGTGGCCATTC 
TGCAGAGCCTGCGGCATCCCGGGATCGTGAACCTGGAGTGCATGTTCGAGACGCCTGAGA 
AAGTGTTTGTGGTGATGGAGAAGCTGCATGGGGACATGTTGGAGATGATCCTGTCCAGTG 
AGAAGGGCCGGGTGCCTGAGCGCCTCACCAAGTTCCTCATCACCCAGATCCTGGTGGCTT 
TGAGACACCTTCACTTCAAGAACATTGTCCACTGTGACTTGAAACCAGAAAACGTGTTGC 
TGGCATCAGCAGACCCATTTCCTCAGGTGAAGCTGTGTGACTTTGGCTTTGCTCGCATCA 
TCGGCGAGAAGTCGTTCCGCCGGTCAGTGGTGGGCACGCCGGCCTACCTGGCACCCGAGG 
TGCTGCTCAACCAGGGCTACAACCGCTCGCTGGACATGTGGTCAGTGGGCGTGATCATGT 
ACGTCAGCCTCAGCGGCACCTTCCCTTTCAACGAGGATGAGGACATCAATGACCAGATCC 
AGAACGCCGCCTTCATGTACCCGGCCAGCCCCTGGAGCCACATCTCAGCTGGAGCCATTG 
AGCTCATCAACAACCTGCTGCAGGTGAAGATGCGCAAACGCTACAGCGTGGACAAATCTC 
TCAGCCACCCCTGGTTACAGGAGTACGAGACGTGGCTGGACCTCCGAGAGCTGGAGGGGA 
AGATGGGAGAGGGATACATCAGGCATGAGAGTGACGAGGCGCGCTGGGAGCAGTTTGCAG 
? CAGAGCATCCGCTGCCTGGGTCTGGGCTGCCCACGGACAGGGATCTCGGTGGGGCCTGTC 
CACCACAGGACCACGACATGCAGGGGCTGGCGGAGCGCATCAGTGTTCTCTGAGGTCCTG 
TGCCCTCGTCCAGCTGCTGCCCTCCACAGCGGTTCTTCACAGGATCCCAGCAATGAACTG 
TTCTAGGGAAAGTGGCTTCCTGCCCAAACTGGATGGGACACGTGGGGAGTGGGGTGGGGG 
GAGCTATTTCCAAGGCCCCTCCCTGTTTCCCCAGCAATTAAAACGGACTCATCTCTGGCC 
CCATGGCCTTGATCTCAGCAAAA 

SEQ ID NO: 10_AA127299_H 

ATTCAATTCATAATTGTTGGTGCAAAAGATTTGCTTGCTATGGATTCAAATGGTCTTTCT 
GATCCTTACATCAAAATCACAAATCTTTCTCAAAAAACGAAAGTGATTAAGAAAACTTTG 
ACTCCAACTTGGAATGAAACTTTTTTTGTGCATTTTCCAGAAAAAACAACCCTTGAATTA 
GAATGTTGGGACCACGATACTTTTTCAGATGATTTTATTGGCAAGGCTTCCATTTCTTTG 
GCAGAGATTCCAGCTTTGGCAGAAGTTGATATGTGGATAGATATGAAAACGAAAAAAGGA 
GAATTTGCAGGAAAA 

SEQ ID NO: 1 1_AA3 1 6 8 04_H 

ATGTCTGCAAATAATTCGCCTCCATCAGGGCAGAAGTGTGTATTACCCACAGCTATTCCT 

GCTGTGCTTCCAGCTGCTTCTCCGTGTTCAAGTCCTAAGACGGGACTCTCTGCCCGACTC 
TGTAATGGAAGGTTCAGTGCAGCATGAGTGAGGAAGTGGAGAGGCTGAGTGGATAGAGTT 
TCATTTCTACTGCAAATTGGGGTGAGACGGGAGAGTGTTACCATTGAAGCGCAGGAACTG 
TCTTTATCTGCTGTCAAGGATCTTGTGTGCTCCATAGTTTATCAAAAGTTTCCAGAGTGT 
GG ATTCTTTGGCATGTATGACAAAATTCTTCTCTTTCGCCATG AC ATGAACTCAGAAAAC 
ATTTTGCAGCTGATTACCTCAGCAGATGAAATACATGAAGGAGACGTAGTGGAAGTGGTT 
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CTTTCAGCTTTAGCCACAGTAGAAGACTTCCAGATTCGTCCACATACTCTCTATGTACAT 

TCTTACAAAGCTCCTACTTTCTGTGATTACTGTGGTGAGATGCTGTGGGGATTGGTACGT 

CAAGGACTGAAATGTGAAGGCTGTGGATTAAATTACCATAAACGATGTGCCTTCAAGATT 

CCAAATAACTGTAGTGGAGTAAGAAAGAGACGTCTGTCAAATGTATCTTTACCAGGACCC 

GGCCTCTCAGTTCCAAGACCCCTACAGGCTGAATATGTAGCCCTTCCCAGTGAAGAGTCA 

CATGTCCACCAGGAACCAAGTAAGAGAATTCCTTCTTGGAGTGGTCGCCCAATCTGGATG 

GAAAAGATGGTAATGTGCAGAGTGAAAGTTCCACACACATTTGCTGTTCACTCTTACACC 

CGTCCCACGATATGTCAGTACTGCAAGCGGTTACTGAAAGGCGTCTTTCGCCAAGGAATG 

GAGTGTAAAGATTGCAAATTCAACTGGCATAAACGCTGTGCATCAAAAGTACCAAGAGAC 

TGCCTTGGAGAGGTTACTTTCAATGGAGAACCTTCCAGTCTGGGAACAGATACAGATATA 

CCAATGGATATTGACAATAATGACATAAATAGTGATAGTAGTCGGGGTTTGGATGACACA 

GAAGAGCCATCACCCCCAGAAGATAAGATGTTCTTCTTGGATCCATCTGATCTCGATGTG 

GAAAGAGATGAAGAAGCCGTTAAAACAATCAGTCCATCAACAAGGAATAATATTCCGCTA 

ATGAGGGTTGTACAATCCATCAAGCACACAAAGAGGAAGAGCAGCACAATGGTGAAGGAA 

GGGTGGATGGTCCATTACACCAGCAGGGATAACCTGAGAAAGAGGCATTATTGGAGACTT 

GAC^GCAAATGTCTAACATTATTTCAGAATGAATCTGGATCAAAGTATTATAAGGAAATT 

CCACTTTCAGAAATTCTCCGCATATCTTCACCACGAGATTTCACAAACATTTCACAAGGC 

AGCAATCCACACTGTTTTGAAATCATTACTGATAGTATGGTATACTTCGTTGGTGAGAAC 

AATGGGGACAGCTCTCATAATCCTGTTCTTGCTGCCACTGGAGTTGGACTTGATGTAGGA 

CAGAGCTGGGAAAAAGCAATTCGCCAAGCCCTCATGCCTGTTACTCCTCAAGCAAGTGTT 

TGCACTTCTCCAGGGCAAGGGAAAGATCACAAAGATTTGTCTACAAGTATCTCTGTATCT 

AATTGTCAGATTCAGGAGAATGTGGATATCAGTACTGTTTACCAGATCTTTGCAGATGAG 

GTGCTTGGTTCAGGCCAGTTTGGCATCGTTTATGGAGGAAAACATAG7VAAGACTGGGAGG 

GATGTGGCTATTAAAGTAATTGATAAGATGAGATTCCCCACAAAACAAGAAAGTCAACTC 

GGTAATGAAGTGGCTATTTTACAGAATTTGCACCATCCTGGGATTGTAAACCTGGAATGT 

ATGTTTGAAACCCCAGAACGAGTCTTTGTAGTAATGGAAAAGCTGCATGGAGATATGTTG 

GAAATGATTGTATCCAGTGAGAAAAGTCGGCTTCCAGAACGAATTACTAAATTCATGGTC 

ACACAGATACTTGTTGCTTTGAGGAATCTGCATTTTAAGAATATTGTGCACTGTGATTTA 

AAGCCAGAAAATGTGCTGCTTGCATCAGCAGAGCCATTTCCTCAGGTGAAGCTGTGTGAC 

TTTGGATTTGCACGCATCATTGGTGAAAAGTCATTCAGGAGATCTGTGGTAGGAAGTCCA 

GCATACTTAGCCCCTGAAGTTCTCCGGAGCAAAGGTTACAACCGTTCCCTAGATATGTGG 

TCAGTGGGAGTTATCATCTATGTGAGCCTCAGTGGCACATTTCCTTTTAATGAGGATGAA 

GATATAAATGACCAAATCCAAAATGCTGCATTTATGTAGCCACCAAATCCATGGAGAGAA 

ATTTCTGGTGAAGCAATTGATCTGATAAACAATCTGCTTCAAGTGAAGATGAGAAAACGT 

TACAGTGTTGACLAAATCTCTTAGTCATCCCTGGCTACAGGACTATCAGACTTGGCTTGAG 

CTTAGAGAATTTGAAACTCGCATTGGAGAACGTTACATTACACATGAAAGTGATGATGCT 

CGCTGGGAAATACATGCATACACACATAACGTTGTATAGGCAAAGCACTTCATTATGGGT 

CCTAATCCAGATGATATGGAAGAAGATCCTTAA 

SEQ ID NO: 12_PKNBETA_H 

ATGGAGGAGGGGGCGGGGCGGCAGCCTGGGCCGAGCCAGTGGCCCCCAGAGGATGAGAAG 
GAGGTGATCCGCCGGGCCATCCAGAAAGAGCTGAAGATCAAGGAGGGGGTGGAGAACCTG 
CGGCGCGTGGCCACAGACCGCCGCCACTTGGGCCATGTGCAGCAGCTGCTGCGGTCCTCC 
AACCGCCGCCTGGAGCAGCTGCATGGCGAGCTGCGGGAGCTGCACGCCCGAATCCTGCTG 
CCCGGCCCTGGGCCTGGCCCAGCTGAGCCTGTGGCCTCAGGACCCCGGCCGTGGGCAGAG 
CAGCTCAGGGCTCGGCACCTAGAGGCTCTCCGGAGGCAGCTGCATGTGGAGCTGAAGGTG 
AAACAGGGGGCTGAGAACATGACCCACACGTGCGCCAGTGGCACCCCCAAGGAGAGGAAG 
CTCCTTGCAGCTGCCCAGCAGATGCTGCGGGACAGCCAGCTGAAGGTGGCCCTGCTGCGG 
ATGAAGATCAGGAGCCTGGAGGCCAGTGGGTGCGCGGAGCCAGGGGGTGAGCTACTGGCG 
GAGGAGCTACAGCATCGACTGCACGTTGAGGCAGCGGTGGCTGAGGGCGCCAAGAACGTG 
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GTGAAACTGCTTAGTAGCCG-jAGAACACAGGACCGCAAGGCACTGGCTGAGGCCCAQGCC 
GAG CTACAGGAGTCCTCTCAGAAACTGGACCTCCTGCGC CTGGCCTTGGAGCAGCTGCTG 
GAGCAACTGCCTCCTGCCCACCCTTTGCGCAGCAGAGTGACCCGAGAGTTGCGGGCTGCG 
G TGCCTGGATACCCCCAGCCTTCAGGGACACCTGTGAAGCCCACCGCCCTAACAGGGACA 
CTGCAGGTCCGCCTCCTGGGCTGTGAACAGTTGCTGACAGCCGTGCCTGGGCGCTCCCCA 
GCGGGGGCACTGGCCAGCAGCCCCTCCGAGGGCTGGCTTCGGACCAAGGCCAAGCACCAG 
CGTGGCGGAGGGGAGCTTGCGAGTGAGGTGCTGGCTGTGCTAAAGGTGGACAACCGTGTT 
GTGGGGCAGACGGGCTGGGGGCAGGTGGCCGAA CAGTCCTGGGACCAGACCTTTGTCATC 
GCACTGGAGCGAGCCCGTGAGCTGGAGATTGGGGTACACTGGCGGGACTGGCGGCAGCTA 
TGTGGCGTGGGCTTCGTGAGACTTGAAGACTTCCTGGACAATGCCTGTCACCAACTGTCC 
CTCAGCCTGGTACCGCAGGGACTGCTTTTTGCCCAGGTGACCTTCTGCGATCCTGTCATT 
GAGAGGCGGCCCCGGCTGCAGAGGCAGGAACGCATCTTCTCTAAACGCAGAGGCCAGGAC 
TTCCTGAGGCGTTCGCAGATGAACCTCGGCATGGCGGCCTGGGGGCGCCTCGTCATGAAC 
CTGCTGCCCCCCTGCAGCTCCCCGAGCACAATCAGCCCCCCTAAAGGATGCCCTCGGACC 
CCAACAACACTGCGAGAGGCCTCTGACCCTGCCACTCCCAGTAATTTCCTGCCCAAGAAG 
ACCCCCTTGGGTGAAGAGATGACACCCCCACCCAAGCCCCCACGCCTCTACCTCCCCCAG 
GAGCCAACATCCGAGGAGACTCCGCGCACCAAACGTCCCCATATGGAGCCTAGGACTCGA 
CGTGGGCCATCTCCACCAGCCTCCCCCACCAGGAAACCCCCTCGGCTTCAGGACTTCCGC 
TGCTTAGCTGTGCTGGGCCGGGGACACTTTGGGAAGGTCCTCCTGGTCCAGTTCAAGGGG 
ACAGGGAAATACTACGC CAT C AAAGC AC TGAAGAAGCAGGAGGTG CTC AG CCGGGACG AG 
ATAGAGAGCCTGTACTGCGAGAAGCGGATCCTGGAGGCTGTGGGCTGCACAGGGCACCCT 
TTCCTGCTCTCCCTCCTTGTCTGCTTCCAGACCTCCAGCCATGCCCGCTTTGTGACTGAG 
TTTGTGCCTGGTGGTGACCTCATGATGCAGATCCACGAGGATGTCTTCCCCGAGCCCCAG 
GCCCGCTTCTACGTGGCTTGTGTTGTCCTGGGGCTGCAGTTCTTACACGAGAAGAAGATC 
ATTTACAGGGACCTGAAGTTGGATAACCTTCTGCTGGATGCCCAGGGATTCCTGAAGATC 
GCAGACTTTGGACTCTGCAAGGAAGGGATCGGCTTCGGGGACCGGACTAGCACCTTCTGT 
GGCACCCCGGAGTTCCTGGCTCCCGAGGTGCTGACCCAGGAGGCATACACACAGGCCGTC 
^SRCTGGTGGGCGCTGGGTGTGCTGCTCTACGAGATGCTGGTGGGTGAGTGCCCGTTCCCA 
GGGGACACAGAGGAAGAGGTGTTTGACTGCATCGTCAACATGGACGCCCCCTACCCCGGC 
TTTCTGTCGGTGCAAGGGCTTGAGTTCATTCAGAAGCTCCTCCAGAAGTGCCCGGAGAAG 
CGCCTCGGGGCAGGTGAGCAGGATGCCGAGGAGATCAAGGTCCAGCCATTCTTCAGGACC 
ACCAACTGGCAAGCCCTGCTCGCCCGCACCATCCAGCCCCCCTTCGTGCCTACCCTGTGT 
GGCCCTGCGGACCTGCGCTACTTTGAGGGCGAGTTCACAGGGCTGCCGCCTGCCCTGACC 
CCACCTGCACCCCACAGCCTCCTCACTGCCCGCCAACAGGCCGCCTTCCGGGACTTCGAC 
TTTGTGTCAGAGCGATTCCTGGAACCCTGA 

SEQ ID NO: 1 3_AI 0 2 1 02 3_M PKNBETA_M 

GCTGAAGTGGGATAACCTTCTGCTGGATGCCCAGGGATTCCTGAAGATCGCAGACTTTGG 
ACTCTGCAAGGAAGGGATCGGCTTCGGGGACCGGACTAGCACCTTCTGTGGCACCCCGGA 
GTTCCTGGCTCCCGAGGTGCTGACCCAGGAGGCATACACACGGGCTGTGGACTGGTGGGG 
GCTGGGTGTGCTGCTCTACGAGATGCTGGTGGGTGAGTGCCCGTTCCCAGGGGACACAGA 
GGAAGAGGTGTTTGACTGCATCGTCAACATGGACGCCCCCTACCCCGGCTTTCTGTCGGT 
GCAAGGGCTTGAGTTCATTCAGAAGCTCCTCCAGAAGTGCCCGGAGAAGCGCCTCGGGGC 
GGGTGAGCAGGATGCCGAGGAGATCAAGGTCCAGCCATTCTTCAGGACCACCAACTGGCA 
AGCCCTGCTCGCCCGCACCATCCAGCCCCCCTTTGTGCCTACCCTGTGTGGCCCTGCGGA 
CCTGCGCTACTTTGAGGGCGAGTTCACAGGGCTGCCGCCTGCCCTGACCCCACCTGCACC 
CCACAGCCTCCTCACTGCCCGCCAACAGGCCGCCTTCCGGGACTTCGACTTTGTGTCAGA 
GCGATTCCTGGAACCCTGAGGGCATCTCCTGGCACCTCTGTCCCCTTCCCCCACAGACTG 
TTAGAGCCTCTGCTCGTTCACCCGTGCGCCCTGCCTGGAGGTCCAGGCCTTGCTGGGTAC 
TTCTGAGCCCTTGGGATTCAAAGTGGCAGCCATGGGGCCACTGTTGTGGGCTTTGCTCAG 
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FIGURE 2K 

TGTCACTGGGCAAAGTGTGTCCCTTCCCCCTCCAGCTCGCCCTCTTCTACCTCCCAGCGA 
GACCTGGCCCAGAAAGGGTGCCGCAGCAAGGAGTGATATGGTTTGTCTTTTTAAGACTGG 
AGTTGCTTTATATTAAATTTGTAAAAGTG 

SEQ ID NO: 14_H1S102_H 

GGTGGCAACATCCGGGGTCCCTGGGCCCGAGGCTGGAAGAGCCTCTGGACAGGTTTGGGA 
ACCATCAGGTCAGATCTGGAAGAAGTCTGGGAACTACGGGGGCAGCACTATCTGGAGGAG 
GAATCCCTAAAGCCAGCCCCAGTACTGGTAGAGAAGCCTCTGCCAGAGTGGCCAGTGCCT 
CAGTTCATCAACCTCTTTCTACCAGAGTTTCCCATTAGGCCCATTAGGGGGCAGCAGCAG 
CTGAAGATTTTAGGCCTCGTGGCTAAAGGCTCCTTTGGAACTGTCCTCAAGGTGCTAGAT 
TGCACCCAGAAAGCTGTATTTGCAGTGAAGGTGGTGCCGAAGGTAAAGGTCCTACAGAGG 
GATACCGTGAGGCAGTGCAAAGAGGAGGTTAGCATCCAGCGACAGATCAACCATCCCTTT 
GTACACAGCTTGGGGGACAGCTGGCAGGGAAAACGGCACCTTTTCATTATGTGTAGCTAC 
TGCAGCACAGATCTGTACTCCCTTTGGTCGGCTGTTGGCTGCTTTCCTGAGGCTTCCATC 
CGTCTCTTTGCTGCCGAGTTGGTGCTGGTACTGTGTTATCTCCATGACTTGGGCATCATG 
CATCGAGATGTGAAGATGGAGAATATTCTTCTAGATGAACGAGGCCATCTGA7VACTGACA 
GACTTTGGTCTGTCCCGCCACGTGCCCCAGGGAGCTCAAGCCTACACTATCTGTGGCACT 
CTTCAGTACATGGCCCCAGAGGTCCTAAGTGGAGGACCTTACAACCATGCTGCTGATTGG 
TGGTCCCTGGGTGTCTTGCTTTTCTCTGTGGCGACTGGAAAGTTTCCAGTGGCTGCAGAG 
AGAGATCATGTGGCCATGTTGGCAAGTGTGACCCACAGTGACTCTGAGATCCCAGCTTCT 
CTTAACCAGGGCCTCTCACTCCTGCTCCATGAGCTCTTATGCCAGAACCCCCTCCATCGT 
CTACGTTATCTGCATCACTTCCAGGTCCACCCTTTCTTTCGGGGTGTGGCCTTCGACCCA 
GAGCTCCTACAGAAGCAGCCAGTGAACTTTGTCACGGAGACACAAGCTACCCAGGCCAGT 
TCAGCGGAGACCATGCCCTTTGACGACTTTGACTGTGATGTGGAGTCCTTCTTGCTCTAC 

CCTATCCCTGCTTGA 

SEQ ID NO: 15_AA476563_H 

ATGGAATTCTTTAGGATAGACAGTAAGGATAGGGCAAGTGAAGTGCTGGGACTTGACTTT 
GGAGAAAAATTGTATAGTGTAAAATCAGAACCTTTGAAAGCATTCTTTACTGTTCCAGAT 
GGAGACAGTGGTTCTAGGAGTTTTAATACTAGTGAAAGCAAGGTAGAGTTTAAAGCTCAG 
GACACCATTAGCAGGGGCTCAGATGACTCAGTGCCAGTTATTTCGTTTAAAGATGCTGCT 
TTTGATGATGTCAGTGGTACTGATGAAGGAAGACCTGATCTTCTTGTAAATTTACCTGGT 
GAATTGGAGTGAACAAGAGAAGCTGCAGCAATGGGACCTACTAAGTTTACACAAACTAAT 
ATAGGGATAATAGAAAATAAACTCTTGGAAGCCCCTGATGTTTTATGCCTCAGGCTTAGT 
ACTGAACAATGCCAAGCACATGAGGAGAAAGGCATAGAGGAACTGAGTGATCCCTCTGGG 
CCCAAATCCTATAGTATAACAGAGA/^ACACTATGCACAGGAGGATCCCAGGATGTTATTT 
GTAGCAGCTGTTGATCATAGTAGTTCAGGAGATATGTCTTTGTTACCCAGCTCAGATCCT 
AAGTTTCAAGGACTTGGAGTGGTTGAGTCAGCAGTAACTGCAAACAACACAGAAGAAAGC 
TTATTCCGTATTTGTAGTCCACTCTCAGGTGCTAATGAATATATTGCAAGCACAGACACT 
TTAAAAACAGAAGAAGTATTGCTGTTTACAGATCAGAGTGATGATTTGGCTAAAGAGGAA 
CCAACTTGTTTATTGCAGAGAGACTGTGAGAGTAAGGGTGAAAGTGGTTTAGTGGTAGAA 
GGAGACAAGGAAATACATCAGATTTTTGAGGACCTTGATAAAAAATTAGCACTAGCCTCC 
AGGTTTTACATCCCAGAGGGCTGCATTCAAAGATGGGCAGCTGAAATGGTGGTAGCCCTT 
GATGCTTTACATAGAGAGGGAATTGTGTGCCGGGATTTGAACGCAAAGAACATGTTATTG 
7VATGATAGAGGACACATTCAGCTAACGTATTTTAGCAGGTGGAGTGAGGTTGAAGATTCC 
TGTGACAGCGATGCCATAGAGAGAATGTAGTGTGCCCCAGAGGTTGGAGCAATCACTGAA 
GAAACTGAAGCCTGTGATTGGTGGAGTTTGGGTGCTGTCCTCTTTGAACTTCTCACTGGC 
AAGAGTGTGGTTGAATGCCATCGAGCAGGAATAAATAGTCAGACTACTTTGAACATGGCA 
GAATGTGTGTGTGAAGAGGCTCGCTCAGTCATTCAACAGCTGTTGCAGTTCAATCCTGTG 
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FIGURE 2L 

3AACGACTTGGTGCTG3AGTTGCTGGTGTTGAAGATATCAAATCTCATCCATTTTTTACC 
CCTGTGGATTGGGCAGAAGTGATGAGATGA 

SEQ ID NO: 16_AA626690_H 

ATGCTACCATTCGCTCCTCAGGACGAGCCCTGGGACCGAGAAATGGAAGTGTTCAGCGGC 
GGCGGCGCGAGCAGCGGCGAGGTAAATGGTCTTAAAATGGTTGATGAGCCAATGGAAGAG 
GGAGAAGCAGATTCTTGTCATGATGAAGGAGTTGTTAAAGAAATCCCTATTACTCATCAT 
GTTAAGGAAGGCTATGAGAAAGCAGATCCTGCACAGTTTGAGTTGCTCAAGGTTCTTGGT 
CAGGGGTCATTTGGAAAGGTTTTTCTTGTTAGAAAGAAGACCGGTCCTGATGCTGGGCAG 
CTCTATGCAATGAAGGTGTTAAAAAAAGCCTCTTTAAAAGTTCGAGACAGAGTTCGGACA 
AAGATGGAGAGGGATATACTGGTGGAAGTAAATCATCCATTTATTGTCAAATTGCACTAT 
GCCTTTCAGACTGAAGGGAAACTGTACTTAATACTGGATTTTCTCAGGGGAGGAGATGTT 
TTCACAAGATTATCCAAAGAGGTTCTGTTTACAGAGGAAGATGTGAAATTCTACCTCGCA 
GAACTGGCCCTTGCTTTGGATCATCTGCACCAATTAGGAATTGTTTATAGAGACCTGAAG 
CCAGAAAACATTTTGCTTGATGAAATAGGACATATCAAATTAACAGATTTTGGACTCAGC 
AAGGAGTCAGTAGATCAAGAAAAGAAGGCTTACTCATTTTGTGGTACAGTAGAGTATATG 
GCTCCTGAAGTAGTAAATAGGAGAGGCCATTCCCAGAGTGCTGATTGGTGGTCATATGGT 
GTTCTTATGTTTGAAATGCTTACTGGTACTCTGCCATTTCAAGGTAAAGACAGAAATGAG 
ACCATGAATATGATATTAAAAGCAAAACTTGGAATGCCTCAATTTCTTAGTGCTGAAGCA 
CAAAGTCTTCTAAGGATGTTATTCAAAAGGAATCCAGCAAATAGATTGGGATCAGAAGGA 
GTTGAAGAAATCAAAAGACATCTGTTTTTTGCAAATATTGACTGGGATAAATTATATAAA 
AGAGAAGTTCAACCTCCTTTCAAACCTGCTTCTGGAAAACCAGATGATACTTTTTGTTTT 
GATCCTGAATTTACTGGAAAAACACCTAAAGATTCTCCCGGTTTGCCAGCCAGTGCAAAT 
GCTCATCAGCTCTTCAAAGGATTCAGCTTTGTTGCAACTTCTATTGCAGAAGAATATAAA 
ATCACTCCTATCACAAGTGCAAATGTATTACCAATTGTTCAGATAAATGGAAATGCTGCA 
CAATTTGGTGAAGTATATGAATTGAAGGAGGATATTGGTGTTGGCTCCTACTCTGTTTGC 
^ AAGCGATGCATACATGCAACTACCAACATGGAATTTGCAGTGAAGATCATTGACAAAAGT 
'AAGCGAGACCCTTCAGAAGAGATTGAAATATTGATGCGCTATGGACAACATCCCAACATT 
ATTACTTTGAAGGATGTCTTTGATGATGGTAGATATGTTTACCTTGTTACGGATTTAATG 
AAAGGAGGAGAGTTACTTGACCGTATTCTCAAACAAAAATGTTTCTCGGAACGGGAGGCT 
AGTGATATACTATATGTAATAAGTAAGACAGTTGACTATCTTCATTGTCAAGGAGTTGTT 
CATCGTGATCTTAAACCTAGTAATATTTTATACATGGATGAATCAGCCAGTGCAGATTCA 
ATCAGGATATGTGATTTTGGGTTTGCAAAACAACTTCGAGGAGAAAATGGACTTCTCTTA 
ACTCCATGCTACACTGCAAACTTTGTTGCACCTGAGGTTCTTATGCAACAGGGATATGAT 
GCTGCTTGTGATATCTGGAGTTTAGGAGTCCTTTTTTACACAATGTTGGCTGGCTACACT 
CCATTTGCTAATGGCCCCAATGATACTCCTGAAGAGATACTGCTGCGTATAGGCAATGGA 
AAATTCTCTTTGAGTGGTGGAAACTGGGACAATATTTCAGACGGAGCAAAGGATTTGCTT 
TCCCATATGCTTCATATGGACCCACATCAGCGGTATACTGCTGAACAAATATTAAAGCAC 
TCATGGATAACTCACAGAGACCAGTTGCCAAATGATCAGCCAAAGAGAAATGATGTGTCA 
CATGTTGTTAAGGGAGCAATGGTTGCAACATACTCTGCCCTGACTCACAAGACCTTTCAA 

CCAGTCCTAGAGCCTGTAGCTGCTTCAAGCTTAGCCCAGCGACGGAGCATGAAAAAGCGA 
ACATCAACTGGCCTGTAA 

SEQ ID NO: 1 7_AA2 1 56 8 0_H 

ATGAGCCTGGTGGCCTGTGAGTGCCTGCCCAGCCCCGGCCTGGAGCCTGAGCCTTGCTCA 
CGAGCACGGTCCCAAGCTCACGTGTACCTGGAGCAGATTCGCAACAGGGTGGCTCTGGGA 
GTGCCTGACATGACAAAACGTGACTATCTGGTGGATGCGGCCACGCAGATCCGGCTGGCC 
CTGGAGCGCGATGTTAGTGAGGACTATGAGGCGGCCTTCAACCACTATCAGAATGGCGTG 
GACGTGCTGCTCCGTGGCATACACGTTGAGCCCAACAAGGAGCGACGTGAGGCTGTGAAG 
CTGAAAATTACCAAATACCTGCGGCGGGCAGAGGAGATCTTCAACTGCCACCTGCAGCGG 
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FIGURE 2M 



CCGCTGAGCAGTGGAGCCAGCCCCAGCGCGGGTTTCAGCAGCCTGAGGCTCCGGCCCATT 

CGCACGCTGAGCTCTGCGGTGGAGCAGCTGAGGGGCTGCAGGGTGGTCGGGGTGATCGAG 

AAGGTGCAGCTGGTCCAGGACCCGGCAACCGGAGGGACCTTTGTGGTGAAGAGCCTACCC 

AGGTGCCACATGGTGAGCAGGGAGCGGCTGACCATCATCCCACACGGAGTCCCCTACATG 

ACGAAGCTGCTCAGGTACTTTGTGAGCGAGGACTCCATCTTCCTGCACCTGGAGCATGTG 

CAAGGAGGCACTCTCTGGTCCCACCTGCTCTCCCAGGCGCACTCCCGACATTCTGGGCTC 

AGCTCTGGCTCTACCCAGGAGAGGATGAAGGCTCAGCTCAACCCCCACCTCAACCTCCTG 

ACCCCAGCGAGGCTTCCCTCAGGCCATGCCCCTGGCCAGGACAGAATCGCCCTGGAGCCT 

CCTAGGACTTCTCCGAACCTTCTCCTAGCTGGGGAGGCCCCATCCACCAGACCCCAGAGG 

GAGGCTGAAGGTGAACCCACAGCCAGGACCAGCACCTCTGGCTCCTCGGACCTTCCAAAG 

GCCCCAGGTGGCCACCTGCACCTTCAAGCTAGGAGGGCTGGCCAGAACTCAGACGCTGGG 

CCCCCTCGGGGGCTCACTTGGGTTCCTGAGGGGGCCGGCCCGGTGCTAGGGGGCTGTGGC 

CGAGGCATGGATCAGAGCTGCCTGTCAGCAGATGGGGCCGGCCGGGGCTGTGGCAGGGCC 

ACCTGGAGTGTGAGAGAGGAGCAGGTGAAGCAGTGGGCGGCAGAGATGCTGGTAGCGCTG 

GAGGCGCTGCACGAGCAGGGGGTGCTGTGCCGGGACCTCCACCCCGGGAACCTGCTCCTG 

GACCAGGCAGGTCACATCCGGCTCACATATTTTGGCCAGTGGTCAGAGGTGGAGCCCCAG 

TGCTGCGGGGAGGCCGTGGACAATCTCTACAGCGCCCCAGAGGTGGGTGGGATTTCCGAG 

CTGACGGAAGCCTGTGACTGGTGGAGCTTTGGGTCTCTACTGTATGAACTGCTGACGGGA 

ATGGCACTGTCCCAGAGCCACCCTTCAGGAATCCAGGCCCACACCCAGCTCCAGCTGCCC 

GAGTGGCTCAGTCGCCCAGCGGCCTCTCTGCTGACTGAGCTGCTGCAGTTCGAGCCTACC 

CGGCGCCTGGGCATGGGAGAAGGTGGTGTCAGCAAACTCAAGTCCCATCCCTTTTTCAGT 

ACCATCCAATGGAGCAAGCTGGTGGGGTAA 

SEQ ID NO: 18__SGK_H 

ATGACGGTGAAAACTGAGGCTGCTAAGGGCACCCTCACTTACTCCAGGATGAGGGGCATG 

GTGGCAATTCTCATCGCTTTCATGAAGCAGAGGAGGATGGGTCTGAACGACTTTATTCAG 

AAGATTGCCAATAACTCCTATGCATGCAAACACCCTGAAGTTCAGTCCATCTTGAAGATC 

TCCCAACCTCAGGAGCCTGAGCTTATGAATGCCAACCCTTCTCCTCCACCAAGTCCTTCT 

CAGCAAATCAACCTTGGCCCGTCGTCCAATCCTCATGCTAAACCATCTGACTTTCACTTC 

TTGAAAGTGATCGGAAAGGGGAGTTTTGGAAAGGTTCTTCTAGCAAGACACAAGGCAGAA 

GAAGTGTTCTATGCAGTCAAAGTTTTACAGAAGAAAGCAATCCTGAAAAAGAAAGAGGAG 

AAGCATATTATGTCGGAGCGGAATGTTCTGTTGAAGAATGTGAAGCACCCTTTCCTGGTG 

GGCCTTCACTTCTCTTTCCAGAGTGCTGACAAATTGTACTTTGTCCTAGACTACATTAAT 

GGTGGAGAGTTGTTCTACCATCTCCAGAGGGAACGCTGCTTCCTGGAACCACGGGCTCGT 

TTCTATGCTGCTGAAATAGCCAGTGCCTTGGGCTACCTGCATTCACTGAACATCGTTTAT 

AGAGACTTAAAACCAGAGAATATTTTGCTAGATTCACAGGGACACATTGTCCTTACTGAT 

TTCGGACTCTGCAAGGAGAACATTGAACACAACAGCACAACATCCACCTTCTGTGGCACG 

CCGGAGTATCTCGCACCTGAGGTGCTTCATAAGCAGCCTTATGACAGGACTGTGGACTGG 

TGGTGCCTGGGAGCTGTCTTGTATGAGATGCTGTATGGCCTGCCGCCTTTTTATAGCCGA 

AACACAGCTGAAATGTACGACAACATTCTGAACAAGCCTCTCCAGCTGAAACCAAATATT 

ACAAATTCCGCAAGACACCTCCTGGAGGGCCTCCTGCAGAAGGACAGGACAAAGCGGCTC 

GGGGCCAAGGATGACTTCATGGAGATTAAGAGTCATGTCTTCTTCTCCTTAATTAACTGG 

GATGATCTCATTAATAAGAAGATTACTCCCCCTTTTAACCCAAATGTGAGTGGGCCCAAC 

GAGCTACGGCACTTTGACCCCGAGTTTACCGAAGAGCCTGTCCCCAACTCCATTGGCAAG 

TCCCCTGACAGCGTCCTCGTCACAGCCAGCGTCAAGG7LAGCTGCCGAGGCTTTCCTAGGC 

TTTTGCTATGCGCCTCCCACGGACTGTTTCCTCTGA 

SEQ ID NO: 1 9_AA1 0 7 5 1 5_M 

CGGGTCGACCCACGCGTCCGCCGGTTTCACTGCTCCCCTCAGTCTCTTTTGGGCTCTTTC 
CGGGCATCGGGACGATGACCGTCAAAGCCGAGGCTGCTCGAAGCACCCTTACCTACTCCA 
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FIGURE 2N 

G AATG AGG GG AATGG T AG C G ATT CT C ATCGCTTTTATGAAAC AG AG AAGGATGGGCCTG A 
ACGATTTTATTCAGAAGATTGCCAGCAACACCTATGCATGCAAA.CACGCTGAAGTTCAGT 
CCATTTTGAAAATGTCCCATCCTCAGGAGCCGGAGCTTATGAACGCTAACCCCTCTCCTC 
•GGCCAAGTCCCTCTCAACAAATCAACCTGGGTCCGTCCTCCAACCCTCACGCCAAACCCT 
CCGACTTTCACTTCTTGAAAGTGATCGGAAAGGGCAGTTTTGGAAAGGTTCTTCTGGCTA 
GGCACAAGGCAGAAGAAGTATTCTATGCAGTCAAAGTTTTACAGAAGAAAGCCATCCTGA 
AGAAGAAAGAGGAGAAGCATATTATGTCAGAGCGGAATGTTCTGTTGAAGAATGTGAAGC 
ACGCTTTCGTGGTGGGCCTTGACTTGTGATTCCAGACCGGTGACAAGCTCTAGTTTGTCC 
TGGAGTACATTAATGGTGGAGAGGTGTTGTACCATGTCCAGAGGGAGCGGTGCTTCCTGG 
AACGACGGGGTGGATTCTACGCAGGTGAAATAGGCAGTGGCCTGGGCTATGTGCACTCGC 
TAAACATCGTTTATAGAGACTTAAAACCTGAGAATATTCTCCTAGACTCCCAGGGGCACA 
TCGTCCTCACTGACNTATTTCAGCTGCGTAGAATCGAGCATAACGGGACAACATCTACCT 
TCTGTGGCACGCCTGAGTATCTGGCTCCTGAGGTCCTCCATAAGCAGCCGTATGACCGGA 
CGGTGGACTGGTGGTGTCTTGGGGCTGTCCTGTATGAGATGCTCTACGGCGTGGCCCCGT 
TTTATAGCCGGAACACGGCTGAGATGTACGACAATATTGTGAACAAGCGTCTGCAGTTGA 
AAC CAAA TATTA CAAACT CGGCAAGGC AC CTCCTGG AAGGC CTCCTGC AG AAGG AC CGG A 
CCAAGAGGCTGGGTGCCAAGGATGACTTTATGGAGATTAAGAGTCATATTTTCTTCTCTT 
TAATTAACTGGGATGATCTCATCAATAAGAAGATTACACCCCCATTTAACCCAAATGTGA 
GTGGGCCCAGTGACCTTCGGCACTTCGATCCCGAGTTTACCGAGGAGCCGGTCCCCAGCT 
CCATCGGCAGGTCCCCTGACAGCATCCTTGTCACGGCCAGTGTGAAGGAAGCAGCAGAAG 
CCTTCCTCGGCTTCTCCTATGCACCTCCTGTGGATTCCTTCCTCTGAGTGCTCCCGGGAT 
GGTTCTGAAGGACTTCCTCAGCGTTTCCTAAAGTGTTTTCGTTAGCCTTTGGTGGAGTTG 
CCAGCTGACAGAACATTTTAAAAGAATTTGCACACCTGGAAGCTTGGCAGTCTCGCCTGC 
CCGGCGTGGCGCGACGCAGCGCGCGCTGCTTGATGGGAGCTTTCCGAAGAGCACACCCTC 
CTCTCAATGAGCTTGTGAGGTCTTCTTTTCTTCTCTTCCTTCCAACGTGGTGCTAGCTCC 
AGGCGAGCGAGCGTGAGAGTGCCGCCTGAGACAGACACCTTGGTCTCAGTTAGAAGGAAG 
ATGCAGGTCTAAGAGGAATCCCCGCAGGTCTGTCTGAGCTGTGATCAAGAATATTCTGCA 

'"ATGTGCCTTTTCTGAGATCGTGTTAGCTCCAAAGCTTTTTCCTATCGCAGAGTGTTCAGT 
TTGTGTTTGTTTGTTTTTGTTTTGTTTTGTTTTTCCCTTGGCGGATTTCCCGTGTGTGCA 
GTGGCGTGAGTGTGCTATGCCTGATCACAGACGGTTTTGTTGTGAGCATCAATGTGACAC 
TTGCAGGACACTACAATGTGGGACATTGTTTGTTTCTTCCACATTTGGAAGATAAATTTA 
TGTGTAGACTGTTTTGTAAGATATAGTTAATAACTAAAACCTATTGAAACGGTCTTGCAA 
TGACGAGCATTCAGATGCTTAAGGAAAGCATTGCTGCTACAAATATTTCTATTTTTAGAA 
AGGGTTTTTATGGACCAATGCCCCAGTTGTCAGTCAAAGCCGTTGGTGTTTTCATTGTTT 
AAAATGTCACCTATAAAACGGGCATTATTTATGTTTTTTTTCCCTTTGTTCATATTCTTT 
TGCATTCCTGATTATTGTATGTATCGTGTAAAGGAAGTCTGTACATTGGGTTATAACACT 

AGATATTTAAACTTACAGGCTTATTTGTAAACCATCATTTTAATGTACTGTAATTAACAT 
GGGTTATAATATGTACAATTCCTCCTCCTTACCACACAACTTTTTTTGTGTGCGATAAAC 
CAATTTTGGTTTGCAATAAAATCTTGAAAACT 

SEQ ID NO: 20_AA109508_M 

CCACCTGCAGCGGGAGCGCCGGTTCGTGGAGCCCCGGGCCAGGTTCTACGCTGCTGAGGT 
GGCCAGCGCCATTGGCTACCTGCACTCCCTCAACATCATTTACAGGGATCTGAAACCAGA 
GAACATTCTCTTGGACTGCCAGGGACACGTGGTGCTGACGGATTTTGGCCTCTGCAAGGA 
AGGTGTAGAGCCTGAAGACACCACATCCACATTCTGTGGTACCCCTGAGTACTTGGCACC 
TGAAGTGCTTCGGAAAGAGCCTTATGATCGAGCAGTGGACTGGTGGTGCTTGGGGGCAGT 
CCTCTACGAGATGCTCCATGGCCTGCCGCCCTTCTACAGCCAAGATGTATCCCAGATGTA 
TGAGAACATTCTGCACCAGCCGCTACAGATCCCCGGAGGCCGGACAGTGGCCGCCTGTGA 
CCTCCTGCAAAGCCTTCTCCACAAGGACCAGAGGCAGCGGCTGGGCTCCAAAGCAGACTT 
TCTTGAGATTAAGAACCATGTATTCTTCAGCCCCATAAACTGGGATGACCTGTACCACA.A 
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FIGURE 20 

GAGGCTAACTCCACCCTTCAACCCAAATGTGACAGGACCTGCTGACTTGAAGCATTTTGA 
CCCAGAGTTCACCCAGGAAGCTGTGTCCAAGTCCATTGGCTGTACCCCTGACACTGTGGC 
CAGCAGCTCTGGGGCCTCAAGTGCATTCCTGGGATTTTCTTATGCGCCAGAGGATGATGA 
CATCTTGGATTGCTAGAAGAGAAGGACCTGTGAAACTACTGAGGCCAGCTGGTATTAGTA 
AGGAATTACCTTCAGCTGCTAGGAAGAGCGACTCAAACTAACAATGGCTTCAACGAGAAG 
CAGGTTTATTTTTTCCAGCACATAAAAGAAAAATAATGTTTCGGAGTCCAGGACTGGCAG 
GACAGGTCATCAGATACTCAGAGGCTGTATCTCTGCCCTGCCAACCTTGACAAATGGCTT 
CCAATGTTAGGTTTGCTACAAGATGGTTACTGGAGCTCTAGCTGCCTATTTTGTGTTTAG 
GGAAGGGAAAATGGAGGAAAGGGGAGAAGAGCAAAGGGCGCTTTTAAAGAGCTTTCCCAA 
AAGCTCCACCCAATGACTTCTGCTTCCATCTCACTAACCACCCACCCCTACCTGGAATGG 
AGGCTGGGAGATGTGGCTTATTTGCTGGGTACGTGACTATCCCTAATAACAAAGGGGTTC 
TGACACTAAGACATTAGGGGAGAATGTTGGGTAGGCAGCCAGCACTCTTTTACCAGAGGG 
CCTCCTGGTGTTTGGATTTTGATCTCAATGTGTAAAATGACAGAGATGTAACAAGCTCAT 

AGGGTATCAATATCTCTTATTGTTCT 
SEQ ID NO: 21_AA887783_H 

CGGATGCATTTNTTGGTGTGCTCTTGAGGGATTAAATGCAAAGAGATCACACCATGGACT 
ACAAGGAAAGCTGCCCAAGTGTAAGNATTCCCAGCTCCGATGAACACAGAGAGAAAAAGA 
AGAGGTTTACTGTTTATAAAGTTCTGGTTTCAGTGGGAAGAAGTGAATGGTTTGTCTTCA 
GGAGATATGCAGAGTTTGATAAACTTTATAACACTTTAAAAAAACAGTTTCCTGCTANGG 
CCCTGAAGATTCCTGCCAAGAGAATATTTGGTGATAATTTTGATCCAGATTTTATTAAAC 
AAAGACGAGCAGGACTAAACGAATTCATTCAGAACCTAGTTAGGTATCCAGAACTTTATA 
ACCATCCAGATGTCAGAGCATTCCTTCAAATGGACAGTCCAAAACACCAGTCAGATCCAT 
CTGAAGATGAGGATGAAAGAAGTTCTCAGAAGCTACACTCTACCTCACAGAACATCAACC 
TGGGACCGTCTGGAAATCCTCATGCCAAACCAACTGACTTTGATTTCTTAAAAGTTATTG 
GAAAAGGCAGCTTTGGCAAGGTTCTTCTTGCAAAACGGAAACTGGATGGAAAATTTTATG 
CTGTCAAAGTGTTACAGAAAAAAATAGTTCTCAACAGAAAAGAGCAAAAACATATTATGG 
CTGAACGTAATGTGCTCTTGAAAAATGTGAAACATCCGTTTTTGGTTGGATTGCATTATT 
CCTTCCAAACAACTGAAAAGCTTTATTTTGTTCTGGATTTTGTTAATGGAGGGGAGGGAC 
ATGTTGTCTTAACAGATTTTGGGCTTTGTAAAGAAGGAATTGCTATTTCTGACACCACTA 
CCACATTTTGTGGGACACCAGAGTATCTTGCACCTGAAGTAATTAGAAAACAGCCCTATG 
ACAATACTGTAGATTGGTGGTGCCTTGGGGCTGTTCTGTATGAAATGCTGTATGGATTGC 
CTCCTTTTTATTGCCGAGATGTTGCTGAAATGTATGACAATATCCTTCACAAACCCCTAA 
GTTTGAGGCCAGGAGTGAGTCTTACAGCCTGGTCCATTCTGGAAGAACTCCTAGAAAAAG 
ACAGGCAAAATCGACTTGGTGCCAAGGAAGACTTTCTTGAAATTCAGAATCATCCTTTTT 
TTGAATCACTCAGCTGGGCTGACCTTGTACAAAAGAAGATTCCACCACCATTTAATCCTA 
ATGTGGCTGGACCAGATGATATCAGAAACTTTGACACAGCATTTACAGAAGAAACAGTTC 
CATATTCTGTGTGTGTATCTTCTGACTATTCTATAGTGAATGCCAGTGTATTGGAGGCAG 
ATGATGCATTCGTTGGTTTCTCTTATGCACCTCCTTCAGAAGACTTATTTTTGTGAGCAG 
TTTGCCATTCAGAAACCATTGAGCAAAATAAGTCTATAGATGGGACTGAAACTTCTATTT 
GTGTGAATATATTCAAATATGTATAACTAGTGCCTCATTTTTATATGTAATGATGAAAAC 
TATGAAAAAATGTATTTTCTTCTATGTGCAAGAAAAATAGGGCATTTCAAAGAGCTGTTT 
TGATTAAAATTTATATTCTTGTTTAATAAGCTTATTTTTAAACAATTTAAAAGCTATTAT 
TCTTAGCATTAACCTATTTTTAAAGAAACCTTTTTTGCTATTGACTGTTTTTTCCCTCTA 
AGTTTACACTAACATCTACCCAAGATAGACTGTTTTTTAACAGTCAATTTCAGTTCAGCT 
AACATATATTAATACCTTTGTAACTCTTTGCTATGGCTTTTGTTATCACACCAAAACTAT 
GCAATTGGTACATGGTTGTTTAAGAAGAAACCGTATTTTTCCATGATAAATCACTGTTTG 
AAATATTTGGTTCATGGTATGATCGAAATGTAAAAGCATAATTAACACATTGGCTGCTAG 
TTAACAATTGGAATAACTTTATTCTGCAGATCATTTAAGAAGTAACAGGCCGGGCGCGGi 
GGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGCAGATCACCTGAGGTCA 
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GG AGTTG GAG A CCAGCCTG A CCAACATGG AC AAACCCCGTCTCTACTAAAAAT AC AAAT'i'I' 
TGGCAGGGTGTGGTGGCACATGCCTATAATCCCAGCTACTTGGGAGGCTAAGGCAGGAGA 
ATCGCTTGAACCCGGGAGGCGGAGGTTGCAGTGAGCCGAGA.TCGCACCATTGCACTCCTG 
CCTGGGCAACAAGAGTGAAACTCCATCTCC 

SEQ ID NO: 22_R4780B_H 

ATGGCGCACCAAACGGGCATCCACGCCACGGAAGAGCTGAAGGAATTCTTTGCCAAGGCA 
CGGGCTGGCTCTGTGCGGCTCATCAAGGTTGTGATTGAGGACGAGCAGCTCGTGCTGGGT 
GCCTCGCAGGAGCCAGTAGGCCGCTGGGATCAGGACTATGACAGGGCCGTGCTGCCACTG 
CTGGACGCCCAGCAGCCCTGCTACCTGCTCTACCGCCTCGACTCACAGAATGCTCAGGGC 
TTCGAATGGCTCTTCCTCGCCTGGTCGCCTGATAACTCCCCCGTGCGGCTGAAGATGCTG 
TACGCGGCCACGCGGGCCACAGTGAAAAAGGAGTTTGGAGGTGGCCACATCAAGGATGAG 
CTCTTCGGGACTGTGAAGGATGACCTCTCTTTTGCTGGGTACCAGAAACACCTGTCGTCC 
TGTGCGGCACCTGCCCCGCTGACCTCGGCTGAGAGAGAGCTCCAGCAGATCCGCATTAAC 
GAGGTGAAGACAGAGATCAGTGTGGAAAGCAAGCACCAGACCCTGCAGGGCCTCGCCTTC 
CCCCTGCAGCCTGAGGCCCAGCGGGCACTCCAGCAGCTCAAGCAGAAAATGGTCAACTAC 
ATCCAGATGAAGCTGGACCTAGAGCGGGAAACCATTGAGCTGGTGCACACAGAGCCCACG 
GATGTGGCCCAGCTGCCCTCCCGGGTGCCCCGAGATGCTGCCCGCTACCACTTCTTCCTC 
TACAAGCACACCCATGAGGGCGACCCCCTTGAGTCTGTAGTGTTCATCTACTCCATGCCG 
GGGTACAAGTGCAGCATCAAGGAGCGAATGCTCTACTCCAGCTGCAAGAGCCGCCTCCTC 
GACTCCGTGGAGCAGGACTTCCATCTGGAGATCGCCAAGAAAATTGAGATTGGCGATGGG 
GCAGAGCTGACGGCAGAGTTCCTCTACGACGAGGTGCACCCCAAGCAACACGCCTTCAAG 
CAGGCCTTCGCCAAGCCCAAGGGCCCAGGGGGCAAGCGGGGCCATAAGCGCCTCATCCGC 
GGCCCGGGTGAAAATGGGGATGACAGCTAG 

SEQ ID NO: 23_H60215_H 

CCACGCGTCCGGCGCCGCAGCCATGGAGGGAGGCGGCGGCGGCGGCGGCGGCGGCTCGGG 
FM?.nnr.\n <wn w^.^TGGCTGCGCTGGGAGGCGGCGGTGAGAGGCTCGCACGCCTCCAGCCCGGCCCCGGCCCCC 

CGGGAGGGAGAGCCGAGCAGCCCCGGCTCTGGGCTACGGACTATGGGCGAATAGCTCTGA 
CCACCCGGCGAAGTGCACACACCCAGAAGCTATGTCCTTCGGCAGTAAAAGTTTTACAGC 
ACAATATATGTGCTCTGCTCTCCTCCCGCAATCCTGCTCCAAGAGATCTTAAGCTGGAGG 
CACCAGGTCTGAATTCCAGACTCCTCCCCACCACCCACACTTCACCTCCAACTGGAGCAT 
GACCACAGACCCATTCAGGGAGGCTGGCGGACTCTTCATCCTGGACAGTCCCTTACTGTA 
TGTCAAAGCTGAGAATGAAGCGGAGAGCATCAGACAGAGGAGCTGGGGAAACGTCGGCCA 
GGGCCAAGGCTCTAGGAAGTGGGATTTCTGGAAATAATGCAAAGAGAGCTGGACCATTCA 
TCCTTGGTCCCCGTCTGGGCAACTCACCGGTGCCAAGCATAGTGCAGTGTTTGGCGAGGA 
AAGATGGCACGGATGACTTCTATCAGCTGAAGATCCTGACCCTGGAGGAGAGGGGGGACC 
AAGGCATAGAGAGCCAGGAAGAGCGGCAGGGCAAGATGCTGCTGCACACCGAGTACTCAC 
TGCTGTCTCTCCTGCACACGCAGGATGGCGTGGTGCACCACCACGGCCTCTTCCAGGACC 
GCACCTGTGAAATCGTTGAGGACACAGAATCCAGCCGGATGGTTAAGAAGATGAAGAAGC 
GCATCTGCCTCGTCCTGGACTGCCTCTGTGCTCATGACTTCAGCGATAAGACCGCTGACC 
TCATCAACCTGCAGCACTACGTCATCAAGGAGAAGAGGCTCAGCGAGAGGGAGACTGTGG 
TAATCTTCTACGACGTGGTCCGCGTGGTGGAGGCCCTGCACCAGAAAAATATCGTGCACA 
GAGACCTGAAGCTGGGGAACATGGTGCTCAACAAGAGGACACATCGGATAACCATCACCA 
ACTTCTGCCTCGGGAAGCATCTGGTGAGCGAGGGGGACCTGCTGAAGGACCAGAGAGGGA 
GCCCTGCCTACATCAGTCCCGACGTGCTCAGCGGCCGGCCGTACCGTGGCAAGCCCAGTG 
ACATGTGGGCCCTGGGCGTGGTGCTCTTCACCATGCTGTATGGCCAGTTCCCCTTCTACG 
ACAGCATCCCGCAGGAGCTCTTCCGCAAGATCAAGGCTGCCGAGTATACCATTCCTGAGG 
ATGGACGGGTTTCTGAGAACACCGTGTGTCTCATCCGGAAGCTGCTGGTCCTTGACCCCC 
AGCAGCGCCTGGCCGCCGCCGACGTCCTGGAGGCCCTCAGTGCCATCATTGCATCATGGC 
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AGTCCCTGTCATCTCTGAGTGGGCCTTTGCAAGTGGTTCCTGACATTGATGACCAAATGA 
GCAATGCGGATAGCTCCCAGGAGGCGAAGGTGACGGAGGAGTGCTCCCAGTACGAGTTTG 
AGAACTACATGCGTCAGCAGCTGCTGCTGGCCGAGGAGAAGAGCTCCATCCATGACACCC 
GGAGCTGGGTACCCAAGCGGCAGTTCGGCAGCGCACCACCGGTGCGACGGCTGGGCCACG 
ACGCACAGCCCATGACCTCCTTGGACACGGCCATCCTGGCGCAGCGCTACCTGCGGAAAT 
AACAGCCTCAGCCGGGGCCACCAGCACTGCTGCCACTTCTTCCAGCCCCAGCCAAAGGCG 
TGGCTGTCAGGGCTGGGCCCTGTAGTGCTGGACTCTCCCGGGCCACAATAGGGACAGGGC 
AGGGACAGGGACAGCCCAGGTCACACGTGGGGTCAGCAGAGGTACCACGAAGCTACCTTT 
TGGGATGATTGCTCGATTGTTTGGTTTTTAAATCTGAGAAGCCTAGATAACTAATCTGCT 
TTTAATGACGATGTTTTAATCTACCTCTGTCTCTTTAACCATGCTGTCTCTGGACTGAGC 
AAGAGGGAGGAGGGAGCGTGCTCACCCCAGTCCAGGGGCTTCCCGAGGGGCCAGCAACTG 
ACGTGGGGCGCTGCTCCCCACAGTCCAAATAAGCTGAAAGTGCAGGTCGCTGCAGGCCCG 
AGAGCGAGCTTCCCCTGGTCCCTGCTCTCCCAGGCCCCTGCCAGAGCCTCTTTGCGTCCG 
TCTCTTTCTGATCCAGGCCCCTCAGTCCAAGCTTTGGAAAACCTTCACCTCATCTTAAAC 
CAAACTCAAATATATTTATTTTTTTACCAT 

SEQ ID NO: 24_SGK324_H 

GCCGCGATGGCCAGCACCAGGAGTATCGAGCTGGAGCACTTTGAGGAACGGGACAAAAGG 
GCGCGGCCGGGGTCGCGGAGAGGGGCCCCCAGCTCCTCCGGGGGCAGCAGCAGCTCGGGC 
GCCAAGGGGAACGGGCTCATCCCCAGTCCGGCGCACAGTGCCCACTGCAGCTTCTACCGC 
ACGCGGACCCTGCAGGCCCTCAGCTCGGAGAAGAAGGCCAAGAAGGCGCGCTTCTACCGG 
AACGGGGACCGCTACTTCAAGGGCCTGGTGTTTGCCATCTCGAGCGACCGGTTCCGGTCG 
TTCGATGCGCTCCTCATAGAGCTCACCCGGTCCCTGTCGGACAACGTGAACCTGCCCCAG 
GGTGTCCGCACTATCTACACCATCGACGGCAGCCGGAAGGTGACCAGCCTGGAGGAGCTG 
GTGGAAGGTGAGAGTTACGTGTGTGCATCCAATGAAGGATTTGGTAAAGTGGATTACACG 
AAAAATATTAATCCAAACTGGTCTGTGAACATCAAGGGTGGGACATCCCGAGCGCTGGCT 
GGTGCCTCCTCTGTGAAAAGTGAAGTAAAAGAAAGTAAAGATTTGATCAAACCCAAGTTA 
GTGACTGTGATTCGAAGTGGAGTGAAGCCTAGAAAAGCGGTGCGGATGCTTGTGAATAAA 
AAGACTGCTCATTCCTTTGAACAAGTGTTAACAGATATCAGCGAAGCCATTAAACNAGCC 
TCAGGAGTCGTCAAGAGGCTGTGGAGGCTGGATGGAAAGCAGGTGAGAGTTACGTGTGTG 
CATCTGCCAGACTTTTTTGGTGATGACGATGTTTTTATTGGATGTGGAGGAGAAAAATTT 
GGTTATGCCCAAGATGAGTTTGTGCTGGATCATAGTGAATGTGGTGTGCTGAAGTCATCT 
TATTCTCGATCCTCAGCTGTTAAGTATTCTGGATCCAAAAGCGGTGGGGGCTCTCGACGG 
AGCCAGATTTCTGGTCATGGCAGATCTTCTTCCAATGTAAAGGGTGGACCTGAGCTTGAG 
CGTTGCATAAGTCCTGAAGGTGTGAATGGAAACAGATGGTCTGAATCATCAAGTCTTGTT 
GAGAAATACAAAATTGGAAAGGTCATTGGTGATGGCAATTTTGCAGTAGTCAAAGAGTGT 
ATAGACAGGTCCACTGGAAAGGAGTTTGCCCTAAAGATTATAGACAAAGCCAAATGTTGT 
GGAAAGG7VACACGTGATTGAGAATGAAGTGTCAATACTGGGGCGAGTGAAACATGCCAAT 
ATCATTATGCTGGTGGAGGAGATGGAAACAGCAACTGAGCTCTTTCTGGTGATGG7VATTG 
GTCAAAGGTGGAGATCTGTTTGATGCAATTACTTCGTCGACCAAGTACACTGAGAGAGAT 
GGGAGTGCCATGGTGTACAACTTAGCCAATGCCCTCAGGTATGTCCATGGCCTCAGCATC 
GTGGACAGAGACATCAAACCAGAGAATCTGTTGGTGTGTGAATATCCTGATGGAAGCAAG 
TCTTTGAAACTGGGAGACTTTGGGCTTGCGACTGTGGTAGAAGGCCCTTTATAGACAGTC 
TGTGGCACACCCACTTATGTGGCTCCARAAATCATTGCTGAAACTGGCTATGGCCTGAAG 
GTGGACATTTGGGCAGCTGGTGTGATCACATACATACTTCTCTGTGGATTCCCACCATTC 
CGAAGTGAGAACAATGTCCAGGAAGATGTGTTCGAGCAGATCTTGGCTGGGAAGGTGGAG 
TTTCCGGCCCCCTAGTGGGATAACATCACGGACTGTGCCAAGGAATTAATCAGTCAAATG 
CTTCAGGTAAATGTTGAAGCTCGGTGTACCGCGGGACAAATCCTGAGTCACCCCTGGGTG 
TCAGATGATGCCTCCCAGGAGAATAACATGCAAGCTGAGGTGACAGGTAAACTAAAACAG 
CAGTTTAATAATGGGCTCGCCAAACAGAACAGCAGTACGACCGGGGTGTCCGTCATCATG 
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GTGAGTGGAAGGCGGCAGGTCTGGCCTGACTGCQGAGCCGGCCTTGAAGTTTTTGAATTA 

GGTAGCCGGGAGCTGCGCTCACATGGAAGTTGGTGCCTTCCGTAGTCCTATTTCATATGA 

AGATTGGGTTGGGATGTGGAGGGGACTCATTCGGCAACTCCCAGGCTTTGGGCACTGTGT 

GGAGGGGCTTGTGTAGGGACCAGCAGGCCTGGTGTGAGGGGTGCAGGCGTGAAGGAGCTC 

CTGGCTGGGCCCTCTGGGCAGCTGCTTCCACTCTTGTCTCTGCCTTCTCATCTAGAGAGA 

CTCCCAAGCCCTGGAGGGGTGTGTTGTGTTAGGAATTAACTCCCTGCCTACCCCAAGGCC 

TCAGAAATAGATTATTAGAGATGTGAATTATTCTTTGAGACTTGGGATAAGAAACAGCCA 

AAGCTAAACATATTTCAGTTTTAAAAAATCAGTGTTTTATAAAACACAGTTTGGGGCTTT 

TAAAGGTACATAATCAAGGAAAAAAATATATATTCATTTTTCAGGGTTGGTAACATTTTA 

TGAGATGTCAGTGACAACGATGGCCTTATTTTTTTCAGCCTTTTCTTCTTGCAAAATGTT 

TCTTAAGGCAACTCTCCTAAATACATAAACACAACAAATTAAAATGAAAAGTGACATGAG 

A G TAAATGAA T C AAAAGG AAAAAA C A TTG AA C GAG AGGTG AG GGCAGCAGAGCGGC AG C A 

GGTGTCCAGGGCTGAGCCAATGCAACCCTGGGCGGGAAGGCCAGCTGACGGTGAGCAGGT 

AGAAGCCAGCCAGCCACCCAGGCAGGGACCTTGGTTCTCCCGACACACTCCCAGGAGCAG 

GGAACAGGGGTGGAGTGGCCTTTCCCAGAGCTGGAGTTGGCTGCAGCAGCTTTCGAATCA 

GACCTGCCAAGGTGATGGGCGTCTGAGTTTCACATGTGGGGCCCCCGTGACCGCACTGAG 

TCCTGACAGCTAAGGATGGGCCACCTCCACAGCTCCGTCACTCGTACTTGGGACAGGCCT 

CTCATGCTCTGGGAAGGTCCTCCTTGTTTCCTACCCAACTAGAAGGGAAACAGTGGCATA 

TTCTCATGGTACATGGTTGTCTGAAAGCCTTACGTAGGAAGACGCAGGGTCTAGATAGAA 

GCTATAAGGAAGCCACACACATAACCCACATCCCCACACCGCCAACATGCGCCACAGTGC 

CCACACCCCCCACACCCCCCACATCCCCACCATAATTACCCGCAGCTCCAAATATCTCAT 

SEQ ID NO: 25_W30246_M SGK324_M 

ACCAAGTCCTGCAGCTCCTCTCCAACCAGGCCGGGAAGTTTCAGAGGATTGAAGATTTCT 
GCTCAGGGGAGATCTTCTTGCAACGTAAACGGTGGGGCTGAACTTGACCGTTGGCTGAGC 
CCTGAAGGTGTGAATGGAAACCGGTGCTCGGAGTGGTTCGCGCTTCTGGAGAAATAGAGA 
ATAGGGAAGGTCATCGGGGACGGGAACTTGGCGGTAGTTAAGGAGTGGGTGGACAGGTAC 
v^ACTGGAAAAGAGTTTGCATTAAAGATTATAGAGAAAGGCAAATGCTGTGGAAAGGAGCAT 
CTGATTGAGAACGAAGTGTCAATGCTGCGCCGAGTGAAGCACCCCAACATCATCATGTTG 
GTTGAAGAGATGGAAAGAGCAACTGACCTCTTTCTAGTGATGGAACTGGTCAAAGGTGGA 
GATCTCTTTGATGCGATTACCTCTTCAACCAAGTACAGTGAGAGAGATGGAAGCGCCATG 
GTGTACAACCTAGCCAATGCCCTCCGGTACCTGCACAGCCTCAGCATCGTCCACAGGGAC 
ATCAAGCCTGAGAATCTGCTGGTGTGCGAATACCCAGATGGAACCAAGTCTTTGAAGCTG 
GGAGACTTTGGGCTGGCGACGGTGGTTGAAGGCCCGTTGTACACGGTCTGTGGCACGCCA 
ACTTATGTGGCACCAGAGATCATAGCTGAAACAGGTTATGGCCTGAAGGTGGATGTTTGG 
GCAGCTGGTGTGATTACATACATACTTCTCTGTGGATTCCCACCATTCCGGAGTGAGAAC 
AATCTCCAGGAAGATCTCTTTGACCAGATCTTGGCTGGAAAGCTGGAATTCCCAGCCCCC 
TACTGGGACAACATTACAGACTCTCCTTGTGTGTGTTTTAGGAAATGCTTATGAAGCTGG 
CCCGTGGGCTTCCCAGTGGGACGTGCAGCAGTTCTTGGCAGAGCAGGGCCAGCTCTGCTG 
TGTCATCTCCAGGGTCTCCCATCACCTCTGCTCTTTGCCATGGCAGGTCTGCTGAGACCC 

CGCGGGGACGGGGGCATGGTGCTCCCTGATTGGCCTGTGACCAACCTTCTGGAAGGCTGC 
TGGCAGTTTTCCCTGTTTTCCACCACCCCACTCTTTTTAATAATTGTATATAACTGTACT 
TGTTCTACTTGCTTGTCTTTAAAACAGGGGCCCCCACAGTTCACTCTCACTGTTAGATTT 
TGCCTTTTCCAGGTATCCCCAACCTGCAATAAACTCTTCCCTCTTCAG 

SEQ ID NO: 26_AA383293_H 

CCAGCAGCCAAGAGGGTAGTGGTGTACCGGAATGGGGACCCATTCTTCCCAGGCTCCCAG 
CTGGTGGTGACTCAACGCCGCTTCCCCACCATGGAGGCCTTCCTCTGCGAGGTGACA.TCA 
GCTGTGGAGGCCCCACTGGCTGTGCGTGCCCTCTACACACCTTGTCATGGCCACCCTGTC 
ACCAACCTGGCAGACTTGAAGAJ^CAGAGGGGAGTATGTGGCCGCTGGATTTGAACGATTC 
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CACAAGCTCCCCCCTTACCAGGCTTTTTGTCTCAGTGTGTTCAGGAATGGGGACCTGGTA 
AG T C C C C C A T T T AG T G T G AAG GTG T C G C AGG CTG C C AG C C A G G AC TG G G AAA C T G T G T T 3 
AAGCTCCTGACTGAGAAGGTCAAGTTGCAGAGTGGGGCTGTGAGACTCTGCACCCTAGAG 
GGGCTCCCACTGTCAGCAGGGAAGGAGCTGGTAACTGGCCA'TTACTATGTGGCTGTCGGA 
GAGGATGAGTTCAAGGACCTTCCCTATCCAGCTCTGTCCACAAGAGGGCTCCTGGCAGCA 
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GGAAGGAAGGCTAAGAAGGAGACATGCCTAATCGTGACCCTGACCCTGAAATACCAGCAG 

TCAGAAACAAGCAGAGACGGGCAATCATTCCCATCAGGAGTTATAGGAGTATATGGAGCT 

CCCCACCGAAGGAAGGAGACAGCGGGGGCCCTGGAAGTAGCAGATGATGAAGACACTCAG 

ACAGAGGAGCCCTTGGATGAGAGGGCAGCACAGATAGTGGAACAGGTTACTTGTCTGCAA 

GACTTTTTTGGTGATGACGATGTTTTTATTGCATGTGGACGAGAAAAATTTCGTTATGGC 

CAAGATGACTTTGTCCTGGATCATAGTCGTCGACGGCTGCTGAGAGAGCACCAGGCGGGC 

TTTGAGAAG C T C CGCAGG AC C CG AGG AGAAG AG AAG GAG G CAG AG AAGG AG AAAAAG C C A 

TGTATGTCTGGAGGGAGAAGGATGACTCTCAGAGATGACCAACCTGCAAAGCTAGAAAAG 

GAGCCCAAGACGAGGCCAGAAGAGAACAAGCCAGAGCGGCCCAGCGGTCGGAAGCCACGG 

CCCATGGGCATCATTGCCGCCAATGTGGAAAAGCATTATGAGACTGGCCGGGTCATTGGG 

GATGGGAACTTTGCTGTCGTGAAGGAGTGCAGACACCGCGAGACCAGGCAGGCCTATGCG 

ATGAAGATCATTGACAAGTCCAGACTCAAGGGCAAGGAGGACATGGTGGACAGTGAGATG 

TTGATCATCCAGAGCCTCTCTCACCCCAACATCGTGAAATTGCATGAAGTCTACGAAACA 

GACATGGAAATCTACCTGATCCTGGAGTACGTGCAGGGAGGAGACGTTTTTGACGCCATC 

ATAGAAAGTGTGAAGTTCCCGGAGCCCGATGCTGCCCTCATGATCATGGACTTATGCAAA 

GCCCTCGTCCACATGCACGACAAGAGCATTGTCCACCGGGAGCTCAAGCCGGAAAACCTT 

TTGGTTCAGCGAAATGAGGACAAATCTACTACCTTGAAATTGGCTGATTTTGGACTTGCA 

AAGCATGTGGTGAGACCTATATTTACTGTGTGTGGGACCCCAACTTACGTAGCTCCCGAA 

ATTCTTTCTGAGAAAGGTTATGGACTGGAGGTGGACATGTGGGCTGCTGGCGTGATCCTG 

TATATCCTGCTGTGTGGCTTTCCCCCATTCCGCAGCCCTGAXXGAGGGGACCAGGACGAG 

CTCTTTAACATCATCCAGCTGGGCCACTTTGAGTTCCTCCCCCCTTACTGGGACAATATC 

TCTGATGCTGCTAAAGATCTGGTGAGCCGGTTGCTGGTGGTAGACCGCAAAAAGCGCTAC 

ACAGCTCATCAGGTTCTTCAGCACCCCTGGATCGAAACAGCTGGCAAGACCAATACAGTG 

AAACGACAGAAGCAGGTGTCCCCCAGCAGCGATGGTCACTTCCGGAGCCAGCACAAGAGG 

GTTGTGGAGCAGGTATCATAGTCACCACCTTGGGAATCTGTCCAGCCCCCAGTTCTGCTC 

AAGGACAGAGAAAAGGATAGAAGTTTGAGAGAAAAACAATGAAAGAGGCTTCTTCACATA 

ATTGGTGAATCAGAGGGAGAGAGACTGAGTATATTTTAAAGCATATTAAAAAAATTAAGT 

CAATGTTAAATGTCACAACATATTTTTAGATTTGTATATTTAAAGCCTTTAATACATTTT 

TGGGGGGTAAGCATTGTCATCAGTGAGGAATTTTGGTAATAATGATGTGTTTTGCTTCCC 

CTTTGTAACCAAGTTTATTCTGTACTACAGGAGTGGTGCTTACCAGGGTCTAAACTCCCG 

CTGTGAGATTAATAAGGTGCATTG 

SEQ ID NO: 2 8_AA1 9 7 8 8 3_M 

ATGCCAACCGCGCCGGTCCTGCGCGCGCCGCCGCCGCCAGCGA.CCCCCGCCCCGCCGGCA 
CCCAGTCGCCCTGCGCCTCGCATTGCGGGGCACCGAGGCCCATGTGACCATTCTCTGAAA 
TGCTTAAGCTCGAAGATCTCTGAGAGAAAGCTGCCAGGCCCCTGGTTACCTGCGGGAGGA 
GGACCTCTGGAGAAGCCAGTTCTGGGGCCACGTGGTGCCGTCATGGCGCTGTTCAGCCCT 
CAGAGCAGCCTCCACTCAGTCCGCGCAGAGCACAGCCCACTGAAGCCCAGGGTGGTGACG 
GTGGTGAAGGTGGGTGGGGAGCCCCTCCGTAAGGGCACCCTGGTGGTCAACGGGCGGTCA 
GTGCAGACCTTTGAGCAGCTCCTATCAGACATCTCCGAAGCGTTGGGGTTCCCAGGGTGG 
AAGAACGACCGTGTGCGGAAGCTGTTCACCCTCAAGGGCAGGGAGGTGAAGAGTGTGTCT 
GACTTCTTCCGGGAGGGTGATGCTTTCATAGCTATGGGCAAAGAGCCGCTGACATTGAAG 
AGTATCCAGTTGGCCATGGAGGAGCTGTATCCTAAGAACCGGGCTCTTGCCCTGGCCCCT 
CACAGTAGAGTCGCCTCCCCAAGGCTGAGAAGCAGAGTTGGCAGGAAGCTTGTGAAAGGA 
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AGTCACCGCTGTGGGGAGGCAGGAAGCTATAGCGCGGAAATGGAGAGTAAGGCAGTCTCT 
AGGCATCAGGGCAAGACTTCCACAGTGCTGGCCCCAGAAGACAAGGCGAGGGCCCAGAAG 
TGGGTAAGAGGGAAACAGGAGTCAGAACCTGGTGGCCCGCCTTCACCCGGGGCAGCCACT 
CAGGAGGAGACTCATGCAAGTGGAGAGAAACATCTGGGGGTGGAGATCGAAAAGACCTCC 
GGGGAGATTGTCAGATGTGAGAAGTGTAAGAGAGAAAGAGAGCTGCAGTTGGGCCTGCAG 
AGGGAGCCGTGCCCGCTGGGAACCAGTGAGCTGGACCTGGGGAGAGCTCAGAAGAGGGAT 
TCCGAGAAGTTGGTGAGGACCAAGAGCTGCAGGAGGCCTTCTAAGGCAAAATTTACAGAT 
GGAGAGGAAGGGTGGAAGGGTGACAGCCATCGGGGCAGTCCCAGGGACCCCCCTCAGGAA 
ATGAGGAGGCCCAACAGCAACTCAGACAAGAAAGAGATCAGAGGCTCAGAAAGTCAGGAC 
AGTTATCCTCAGGGGGCACCC7VAGGCCCAGAAGGACTTCGTGGAAGGGCCACCAGCTGTA. 
GAGGAGGGGCCGATAGACATGAGGAGAGAGGACCGGCACACATGCAGGAGCAAGCATGCG 
GGGTGGCTCCGGAGAGAGCAGGAGGCCGAACCCCCAGAGGTCGCGAGAACCCGAGGGGAG 
GAGAAGGAAGCAGAGCACGAGAAGAAGGGAGGCGGGTTAGGAGAGAGGAGGGCGCGAGAG 
AAGGAGTCTAAGAGGAAGCTAGAAGAGAAGAGGCCAGAACGACCCAGTGGCCGGAAGCCG 
AGGGCCAAGGGGATGATGTCAGCGGATGTGGAGAAGCAGTATGAGATAGGTGGGGTGATT 
GGGGATGGGAAGTTTGGCACGGTGAAGGAATGCAGGCAGGGAGAGAGCAAGGAGGCTTAG 
GGGATGAAGATGATTGACAAGTGCCAGCTGAAGGGTAAGGAGGAGATTGTCGACAGTGAG 
ATTTTAATCATCCAGAGTCTCTGTGATCCCAACATTGTGAAACTGGATGAGGTGTACGAA 
AGGGAGGCGGAGATCTACCTGATCATGGAGTATGTGCAGGGAGGGGACCTTTTTGATGCG 
ATCGTTGAAAATGTGAAGTTTCCAGAGCCCGAGGCTGCAGTTATGATCACAGACTTGTGT 
AAGGCCTTGGTCCACATGCACGACAAGAATATCGTCCACCGGGACGTGAAACCAGAAAAG 
CTCCTGGTTCAGCGAAATGAAGACAAGTCTATCACCTTGAAGCTGGCTGATTTTGGCTTG 
GCCAAATATGTGGTGAGGCCTATATTTACTGTGTGTGGGACGCCAACATATGTAGCTCCT 
GAAATTCTTTCTGAGAAAGGTTACGGCCTGGAGGTGGACATGTGGGCGGCAGGTGTGATC 
GTATACATCCTCTTGTGTGGCTTCCCCCCTTTCGGAAGTCCTGAGAGGGACCAAGACGAG 
GTCTTCAACATCATCCAAGTGGGCCAGTTTGAGTTCCTCTCTCCTTACTGGGACAACATT 
TGTGATGCTGCGAAAGATGTGGTGAGAAATTTGGTGGAGGTGGAGGGTAAGAAGCGGTAG 
^A'GGGCCGAACAGGTCCTACAGCATCCCTGGATTGAGATGGTTGGGCATACCAACACAGGG 
AACTCACAGAAGGAGGAGTCGCCCAAGAGTTTAGGTCACTTCCAGAGTGAGCACAAGAAG 
GTTGCAGAGCAGATGGGATAA 

SEQ ID NO: 2 9_DRAK2_H 

CTCCGCTGCTGTCGCCAGGAGTCACTTCACGAGAAGCCAGGTGAGAACCGTCGGCGCTTG 

TCTGGAAAAGTAAAAGTGGATCCTGCCACGTTCGGAGCTCCCTGGGGCCTCGCCCGGGTG 

GAGCTAGAGAACTCGTCCTGTGGCGGCCCCCGGCGTGGGGCGGGACAGCGGCCCGGTGGA 

GGGGGCAGTCCCGGGAGAACCTGCGGCGGCCGGAGCGGTAAAAATAAGTGACTAAAGAAG 

GAGACGTGGGAATCACCTAACATGTCGAGGAGGAGATTTGATTGCGGAAGTATTTCAGGG 

GTAGTAACTACAACTCCTCAAATTCCAATAAAAATGGAAAACTTTAATAATTTCTATATA 

CTTACATCTAAAGAGCTAGGGAGAGGAAAATTTGCTGTGGTTAGACAATGTATATCAAAA 

TGTACTGGCCAAGAATATGCTGCAAAATTTCTAAAAAAGAGAAGAAGAGGACAGGATTGT 

GGGGCAGAAATTTTACACGAGATTGCTGTGCTTGAATTGGCAAAGTCTTGTCCCCGTGTT 

ATTAATCTTCATGAGGTGTATGAAAATACAAGTGAAATCATTTTGATATTGGAATATGCT 

GCAGGTGGAGAAATTTTCAGCCTGTGTTTACCTGAGTTGGCTGAAATGGTTTCTGAAAAT 

GATGTTATCAGACTCATTAAACAAATACTTGAAGGAGTTTATTATCTACATCAGAATAAC 

ATTGTAGAGCTTGATTTAAAGCCACAGAATATATTAGTGAGCAGGATATACCCTCTCGGG 

GACATTAAAATAGTAGATTTTGGAATGTCTCGAAAAATAGGGCATGCGTGTGAACTTCGG 

GAAATCATGGGAACACCAGAATATTTAGCTCCAGAAATCCTGAACTATGATCCCATTACC 

ACAGCAACAGATATGTGGAATATTGGTATAATAGCATATATGTTGTTAACTCACACATCA 

CCATTTGTGGGAGAAGATAATCAAGAAACATACCTCAATATTTCTCAAGTTAATGTAGAT 

TATTCGGAAGAAACTTTTTCA.TCA.GTTTCACAGCTG 3CCACAGACTTTA.TTCAGAGCCT7 1 
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TTAGTAAAAAATCCAGAGAAAAGACCAACAGCAGAGATATGCCTTTCTCATTCTTGGCT/, 

CAGCAGTGGGACTTTGAAAACTTGTTTCACCCTGAAGAAACTTCCAGTTCCTCTCAAACT 
CAGGATCATTCTGTAAGGTCCTCTGAAGACAAGACTTCTAAATCCTCCTGTAATGGAA.ee 
TGTGGTGATAGAGAAGACAAAGAGAATATCCCAGAGGATAGCAGCATGGTTTCCAAAAGA 
TTTCGTTTCGATGACTCATTACCCAATCCCCATGAACTTGTTTCAGATTTGCTCTGTTAG 
CACTTTTTTCTTTGACTCATTTGGACTGAATTTGAAATTTTATATCCACTCCAGTGAGAT 
TATGATTTGTAGCTTCATATATGACATGTTTATATTGTAAATGCACTTTTCCATGGAATA 
ATTTAGGGAAGTGTTTTAATGTTAAATTACTAGTTGCTAGCATGTTATGATTTCATATCC 
TGAGATAGCTCTGCAGATAAGAAAATATTTAAATATATGACAAAAAGTAAAATTGTACAT 

GTGAAAG 

SEQ ID NO: 3 0_W4416 0_M DRAK2_M 

CCAGACGCGGCTGCACTTTTCAAACCTCAACTGTAAGAAGCGTCGGTCAGCGTCTGTGCG 
GTCGCCGCCGGGAGTCGCCTCACAGGGGCCTGGCTGACGGCGACCAGCCGTTGTGGGGAA 
GAGTGCGAGGTAAAAGTCTGCCTAGAGAAGCAGGTCTGGCAGTCATCAACATGTCTCGGA 
GGAGATTCGATTGCCGAAGTGTCTCAGGCTTGCTAACTACAACCCCTCAAACGCCGATTA 
AAACAGAGAATTTTAATAATTTCTATACTCTTACCCCAAAAGAACTTGGGAGAGGAAAAT 
TTGCTGTGGTTAGACAATGTATATCAAAATCAACTGGACAAGAGTATGCTGCCAAATCCC 
TGAAAAAGAGGAGAAGAGGGCAGGATTGCCGGGCGGAAATTCTGCATGAGATAGCTGTGC 
TGGAGCTGGCCAGGTCTTGTCCCCACGTGATTAATCTGCATGAGGTCTACGAAAATGCAA 
CGGAAATCATTTTGGTGTTAGAATATGCTGCGGGTGGAGAAATTTTCAACCTGTGTTTAC 
CTGAGTTAGCCGAAATGGTATCTGAAAATGATGTTATCAGACTCATTAAACAAATCCTTG 
AAGGAGTTCATTATCTACATCAGAATAACATTGTTCACCTTGATTTAAAGCCACAGAATA 
TACTTTTGAGCAGTATATACCCACTCGGGGACATAAAAATTGTAGATTTTGGAATGTCTC 
GAAAAATTGGGAATGCAAGTGAGCTTCGGGAAATCATGGGAACACCTGAATACTTAGCTC 
CAGAAATCCTCAACTATGATCCCATTACCACAGCAACAGATATGTGGAATATTGGCATAA 
TAGCGTATATGTTGTTAACTCATACATCACCATTTGTAGGAGAAGATAATCAAGAAACAT 
ATCTGAATATTTCTCAAGTGAATGTAGATTATTCAGAAGAAATGTTTTCATCAGTTTCAC 
AGCTGGCCACAGACTTCATCCAGAGCCTTCTAGTAAAGAACCCAGAGAAAAGACCAACAG 
CAGAATCCTGCCTATCCCACTCATGGCTGCAGCAGTGGGACTTTGGAAGCTTGTTTCATC 
CTGAGGAAACTTCAGGCTCCTCTCAAATTCAGGATCTGACTCTCAGGTCCTCTGAAGAGA 
AGACCTCCAAGTCCTCCTGTAATGGGAGCTGTGGAGCCCGGGAGGACAAGGAGAACATCC 
CTGAAGATGGCAGCTTAGTTTCTAAAAGATTTCGATTCGATGACTCCTTGCCCAGCCCCC 
ATGAACTTGTTCCAGATTTGTTCTGTTAGCATTTTTCTCTGTGACTCATCTGGACTGACT 
CGGAAATTTGAAATCTCTGGTGTGAGATTGTGTTTGTAGCTTCATATATTATGTTTATAT 
TATAAATGCACTTCTGCTTAGAAGAACTTAAGGAACAGTTTAAATGCTAGGCTTCTGTTG 
GCTAGCATATCATTTCTTGTCCTGAAATTGTTTTGCAGAGGAAAATATTTAAGTATATGA 
CAAAAAATGTAAATTGTGTTTAAGAGAACACATGCAACTGAAAGAACTCAAGTTCAGTCA 
GACTTATAAAATGGGTTATATTATGGTTAGTAAAAGTTGAAAAAAAATGAAAACAGGAAT 
TTAGTAGGTTCTAAGGTAAGCCCTATACCATAACTCTATTACAGAGAATCTGTTTGGGGA 
AATGCTGTCAAGGGTAAACCACAACATATACTGCTTTATAAATACTCCAGAGAGAGTTTA 
TAGTTGAAAGTATTTCCCAGTTACCAATAATAGCTTGAAACTGTAAGATTTTCTTTGTGT 
GCCATGTGCTCGGTGAGAGGACACAGTCAACCAGAGCAGGGTTGATCCAGGCTGTTTCTC 
TGCAAACCGAGTCAAAACTCGACATCATTTCCAGCTCATGTATTTTGTACGTGCATCATA 
TATCAGATCTAATAAGATCTGGAAGATGGATATGCAAATAAGAGGCCTTTGTCTTCTAGA 
ATGATTAGAGTAGAGGAGAATTGGATAGTACAGAATATGCTCTAGTTTCAGTCAGACATA 
TTCATAAAGGGAAATGTTAAGTTCTGGCAGCTGACTTAGTGTTGGATGTCTCCTAAGTCT 
CAGGATAGAAGCCCATCATTAGAGCATAGGCACTTCAGGAATTCTTGTGTGAAATTCTAG 

TGAGTGAGGAGGTGTGACATGCAGCTATCTTTGGGCTCCTTTTGTGTGTGTTCTGCTGGA 
CACAACACATGGGAGTGTTCAGTGTTGTCCGTGGTCAATATCTATGTTCAGTCCTGATGG 
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FIGURE 2\ 

GAGGGGCCTAGGGACT3CTTTGGAGATTTCCCACTGGTGTCCA 1'TTTAAGGTCTGTAATA 
ATGTCATGTTAAGATAACAGATCTCATAAATATGCTACTCTATCAGACTCCGTTGCCAAA 
ACAAATTAAAAGCGTGTGTATTGAAGTGGGTGTTAGTCTAAGAACGTGTAAATTCTTGAA 
ATTGTTAGTAAAATTGCAAATTCTTTAGATAAGTTTAAAGTATTTAAATTGAGCATTGCT' 
GTCTTTGTTTGATTAAAGGTTGAGTTCGTTTATATCTGTTATTTTTAAAGGAAAAGTTGl' 
TTGCGTTTTGTATATGTGTGTGGATATGTGTATGTGTACAGGTATATGTATATATGTATT 
GATAGATAAAATAGAGGCTTTAAAGAACTTG 

SEQ ID NO: 3I_H0I248_H, DRAK1_K 

ATGATCCGTTTGGAGAAGCGAGGCAGGGGCGGCTGGTGGGCAGGCGCGACCTGAGGCTGG 
GGCCGGGCAGGCCGGGGTCTGAGCGGGCCGTGCCGGCCGCCGCCGCCGCCCCAGGCCCGC 
GGGCTGCTGACAGAGATAGGCGCCGTGGTGCGCACCGAGCCCTTCCAGGACGGCTACAGC 
CTGTGCCCGGGCCGGGAGCTGGGCAGGGGGAAATTTGCAGTGGTGAGAAAATGTATAAAG 
AAAGATTGTGGGAAAGAATTTGCTGCAAAGTTCATGAGAAAAAGAAGAAAAGGGCAAGAT 
TGTGGGATGGAAATAATTCATGAGATTGCTGTAGTTGAAGTAGCACAAGACAATCCTTGG 
GTGATTAATTTACATGAAGTTTATGAGAGTGGATCAGAAATGATCTTAGTTCTGGAATAT 
GGTGGTGGGGGTGAAATGTTTGAGCAGTGTGTTGCAGACAGAGAAGAAGGCTTTAAAGAA 
AAAGATGTTGAAAGAGTTATGGGACAGATTTTAGAAGGTGTTCACTTTTTACAGACTCGT 
GATGTAGTTCATGTTGATTTGAAGCCTCAGAATATTCTGTTGAGAAGTGAATCTCCATTG 
GGTGAGATTAAGATTGTTGATTTTGGGGTTTCAAGAATATTGAAGAACAGTGAAGAGGTC 
GG AG AAA TTATGGGTAGCGCTGAATATGTGGCTGGTGAAATTGTTAGTTATGATGGTAT A 
AGCATGGGAAGAGATATGTGGAGGATTGGAGTGTTAACATATGTCATGGTTACAGGAATA 
TCAGGTTTGTTAGGCAATGATAAACAAGAAACATTCTTAAACATCTCAGAGATGAATTTA 
AGTTATTGTGAGGAAGAATTTGATGTTTTGTGTGAGTGGGCTGTTGATTTCATCAGGACA 
CTTTTAGTTAAGAAACGTGAAGATCGAGGGACTGGTGAAGAATGTCTAAAGCACCCCTGG 
TTGACACAGAGCAGTATTCAAGAGCGTTCTTTCAGGATGGAAAAGGCACTAGAAGAAGCA 
AATGGGGTCGAAGAAGGTGATTCTGTGCCTGAAATTAATTCGGATACGGAGAAATCAGAA 
xo ACCGAGGAATCCATTGTAACCGAAGAGTTAATTGTAGTTACTTCATATAGTGTAGGACAA 
TGGAGACAGTGTGAAAAAGAGAAAATGGAGCAAAAGGGCATTTCCAAAGGATTTAAATTT 
GAGGAAGCTTTGCTAGAAGAAATTGGAGGAGAATTTATGTAGTGA 

SEQ ID NO: 32_AA02 144 5_H 

CGGGGCTGCCGGGGCCGGGACTGGGGGAGCCGGGCCCGCGGGCCGCCTGCTGCCTCCGCC 
CGCGCCGGGGTCCCCAGCCGCCCCCGCTGCCGTGTCCCCTGCGGCCGGCCAGCCGCGTCC 
CGCAGCCCCGGCCTCCCGCGGACCCATGCCCGCCCGTATCGGCTAGTACGAGATCGAGCG 
CACCATGGGCAAGGGCAAGTTGGGGGTGGTCAAGCGGGCGACGCACCTCGTCAGCAAGGG 
CAAGGTTGCTATCAAGATGATAGATAAGACCCAGGTGGATGAAGAAAACTTGAAGAAGAT 
TTTGCGGGAAGTTCAAATTATGAAGATGGTTTGCCACCCGCATATCATCAGGCTCTACCA 
GGTTATGGAGACAGAAGGGATGATTTATCTGGTGACAGAATATGCTAGTGGAGGGGAAAT 
ATTTGAGGAGGTGGTGGGGGATGGTAGAATGGCAGAAAAGGAGGGACGTCGGAAGTTCAA 
AGAGATGGTCAGAGCTGTGTATTTTTGTGAGTGTGGGAACATTGTTCATCGTGATTTAAA 
AGCTGAAAATTTACTTCTGGATGCCAATCTGAATATCAAAATAGCAGATTTTGGTTTCAG 
TAACCTGTTCACTGGTGGGCAGCTGGTGAAGAGGTGGTGTGGCAGCGCTCGGTATGGTGC 
AGGTGAAGTGTTTGAAGGAAAAGAATATGATGGGGGCAAAGTGGACATGTGGAGCCTTGG 
AGTTGTGCTGTACGTGGTTGTGTGCGGTGGCCTGCGATTTGATGGAAGGACAGTGCAGAA 
TGTGCGGGGCCGCGTGGTGAGTGGAAAGTTCGGCATCCCATTTTTTATGTCCACAGAATG 
TGAGCATTTGATCGGCGATATGTTGGTGTTAGATCCGAATAAGGGCGTCTCCATGGAGCA 
GATCTGCAAGCACAAGTGGATGAAGCTAGGGGACGCCGATCCCAACTTTGACAGGTTAAT 
AGCTGAATGCCAACAACTAAAGGAAGAAAGACAGGTGGACCCCCTGAATGAGGATGTCC'I- 
CTTGGCCATGGAGGACATGGGACTGGACAAAGAACAGACACTGCAGTCATTAAGATCAGA 
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TGCCTATGATCACTATAGTGCAATCTACAGCCTGCTGTGTGATCGACATAAGAGACATAA 
AACCCTGCGTCTCGGAGCACTTCCTAGCATGCCCCGAGCCCTGGCCTTTCAAGCACCAGT 
CAATATCCAGGCGGAGCAGGCAGGTACTGCTATGAACATCAGCGTTCCCCAGGTGCAGCT 
GATCAACCCAGAGAACCAAATTGTGGAGCCGGATGGGACACTGAATTTGGACAGTGATGA 
GGGTGAAGAGCCTTCCCCTGAAGCATTGGTGCGCTATTTGTCAATGAGGAGGCACACAGT 
GGGTGTGGCTGACCCACGCACGGAAGTTATGGAAGATCTGCAGAAGCTCCTACCTGGCTT 
TCCTGGAGTCAACCCCCAGGCTCCATTCCTGCAGGTGGCCCCTAATGTGAACTTCATGCA 
CAACCTGTTGCCTATGCAAAACTTGCAACCAACCGGGCAACTTGAGTACAAGGAGCAGTC 
TCTCCTACAGCCGCCCACGCTACAGCTGTTGAATGGAATGGGCCCCCTTGGCCGGAGGGC 
ATCAGATGGAGGAGCCAACATCCAACTGCATGCCCAGCAGCTGCTGAAGCGCCCACGGGG 
ACCCTCTCCGCTTGTCACCATGACACCAGCAGTGCCAGCAGTTACCCCTGTGGACGAGGA 
GAGCTCAGACGGGGAGCCAGACCAGGAAGCTGTGCAGAGGTACTTGGCAAATAGGTCCAA 
AAGACATACACTGGCCATGACCAACCCTACAGCTGAGATCCCACCGGACCTACAACGGCA 
GCTAGGACAGCAGCCTTTCCGTTCCCGGGTCTGGCCTCCTCACCTGGTACCTGATCAGCA 
TCGCTCTACCTACAAGGACTCCAACACTCTGCACCTCCCTACGGAGCGTTTCTCCCCTGT 
GCGCCGGTTCTCAGATGGGGCTGCGAGCATCCAGGCCTTCAAAGCTCACCTGGAAAAAAT 
GGGCAACAACAGCAGCATCAAACAGCTGCAGCAGGAGTGTGAGCAGCTGCAGAAGATGTA 
CGGGGGGCAGATTGATGAAAGAACCCTGGAGAAGACCCAGCAGCAGCATATGTTATACCA 
GCAGGAGCAGCACCATCAAATTCTCCAGCAACAAATTCAAGACTCTATCTGTCCTCCTCA 
GCCATCTCCACCTCTTCAGGCTGCATGTGAAAATCAGCCAGCCCTCCTTACCCATCAGCT 
CCAGAGGTTAAGGATTCAGCCTTCAAGCCCACCCCCCAACCACCCCAACAACCATCTCTT 
CAGGCAGCCCAGTAATAGTCCTCCCCCCATGAGCAGTGCCATGATCCAGCCTCACGGGGC 
TGCATCTTCTTCCCAGTTTCAAGGCTTACCTTCCCGCAGTGCAATCTTTCAGCAGCAACC 
TGAGAACTGTTCCTCTCCTCCCAACGTGGCACTAACCTGCTTGGGTATGCAGCAGCCTGC 
TCAGTCACAGCAGGTCACCATCCAAGTCCAAGAGCCTGTTGACATGCTCAGCAACATGCC 
AGGCACAGCTGCAGGCTCCAGTGGGCGCGGCATCTCCATCAGCCCCAGTGCTGGTCAGAT 
GCAGATGCAGCACCGTACCAACCTGATGGCCACCCTCAGCTATGGGCACCGTCCCTTGTC 
CAAGCAGCTGAGTGCTGACAGTGCAGAGGCTCACAGCTTGAACGTGAATCGGTTCTCCCC 
TGCTAACTACGACCAGGCGCATTTACACCCCCATCTGTTTTCGGACCAGTCCCGGGGTTC 
CCCCAGCAGCTACAGCCCTTCAACAGGAGTGGGGTTCTCTCCAACCCAAGCCCTGAAAGT 
CCCTCCACTTGACCAATTCCCCACCTTCCCTCCCAGTGCACATCAGCAGCCGCCACACTA 
TACCACGTCGGCACTACAGCAGGCCCTGCTGTCTCCCACGCCGCCAGACTATACAAGACA 
CCAGCAGGTACCCCACATCCTTCAAGGACTGCTTTCTCCCCGGCATTCGCTCACCGGCCA 
CTCGGACATCCGGCTGCCCCCAACAGAGTTTGCACAGCTCATTAAAAGGCAGCAGCAACA 
ACGGCAGCAGCAGCAGCAACAGCAGCAACAGCAAGAATACCAGGAACTGTTCAGGCACAT 
GAACCAAGGGGATGCGGGGAGTCTGGCTCCCAGCCTTGGGGGACAGAGCATGACAGAGCG 
CCAGGCTTTATCTTATCAAAATGCTGACTCTTATCACCATCACACCAGCCCCCAGCATCT 
GCTACAAATCAGGGCACAAGAATGTGTCTCACAGGCTTCCTCACCCACCCCGCCCCACGG 
GTATGCTCACCAGCCGGCACTGATGCATTCAGAGAGCATGGAGGAGGACTGCTCGTGTGA 
GGGGGCCAAGGATGGCTTCCAAGACAGTAAGAGTTCAAGTACATTGACCAAAGGTTGCCA 
TGACAGCCCTCTGCTCTTGAGTACCGGTGGACCTGGGGACCCTGAATCTTTGCTAGGAAC 
TGTGAGTCATGCCCAAGAATTGGGGATACATCCCTATGGTCATCAGCCAACTGCTGCATT 
CAGTAAAAATAAGGTGCCCAGCAGAGAGCCTGTCATAGGGAACTGCATGGATAGAAGTTC 
TCCAGGACAAGCAGTGGAGCTGCCGGATCACAATGGGCTCGGGTACCCAGCACGCCCCTC 
CGTCCATGAGCACCACAGGCCCCGGGCCCTCCAGAGACACCACACGATCCAGAACAGCGA 
CGATGCTTATGTACAGCTGGATAACTTGCCAGGAATGAGTCTCGTGGCTGGGAAAGCACT 
TAGCTCTGCCCGGATGTCGGATGCAGTTCTCAGTCAGTCTTCGCTCATGGGCAGCCAGCA 
GTTTCAGGATGGGGAAAATGAGGAATGTGGGGCAAGCCTGGGAGGTCATGAGCACCCAGA 
CCTGAGTGATGGCAGCCAGCATTTAAACTCCTCTTGCTATCCATCTACGTGTATTACAGA 
CATTCTGCTCAGCTACAAGCACCCCGAAGTCTCCTTCAGCATGGAGCAGGCAGGCGTGTA 
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A C AA G AAA G A G A G A G TT T TG T G T A G A G 0 T TG G 3 AA T G AAAA G G TT G A T T G T AAA C C C A C A 

GTATCTAGCAGCGTTGTGCCAAATTGCCCTTGTGTTTCTCTCCACCCAAAATATCACAG^ 
TGGTTTGGTGAGATTTGGTTGATGCGTGTGCTGTTCTTTTGGGTTCTGAGAGGGTTTTGG 
CATGTTTGCTTGTATGACCAAGTCACCAAGGAAATAAACAGGAAGGAAATCCATGTTCTC 



SEQ ID NO: 33_2R22-5-Il_H 

GTGGGGCGCTGGCGGTCAGGTCGGCCGCGGCTGAGAGCTCGGGGAGGGTGAAGGGGGACA 

GGGGGCGGTCACGTGGGGAGATCGACACACCGACGGCTGGTGGTGCAGAGGCAAGAACCG 

AGCGGGCCTAGGGTGGCGGCGTGCAGCGAAGCCCAAGAGCTGGCGTGGCCACGAAGGTTG 

AACCAGGCAAATTTTCGAGACAGGTCAGGGCTTAGAGGAAGGTTGATCTAAATAAAGGCC 

GGGTAAAGTGACATTGCAGGGATTAAATCCTTGTTTGGCTGCGTGTGTGAGCAGAAGGCT 

TATTTGCAAGTTTGTTGTTTGGTGGGGTGGAGATTATTAGGTGTGGAGCGCGGTGCAGCT 

TGAGAGAAAGAGAAGCATGAAATGAAGGTCAGAGATGAGATCCCGCAGGAGGGAGGTGGG 

GGGGTGCGAGGGGCATTTAGGGAGGAGAGTGCAAGATTGTGTGGGCATCAAGGGAAATAG 

CAAAGAGAAGGCTTTGTGGTGGGGGAGAGGGAGCTACGACAAAGCATGAGAGTGCACGTG 

TGGCCAGAAAGTTGGTGGAGTGGCATCAGGGGAGTGGGTATGTAACATGTGGGTAATTGT 

ACAGGTAGAGCGTGCAAGTTCAAGGTGAGGGAAGGTGGGAAATGTGTTGAGTGAGGCGAG 

GAGGTGGTGGGTGGGGTGGGGAGAGTCAGGTAGCAGGTTCAGTGCGTTGGTCTCACTAAA 

GCGGAGAGGGAGGCTGGTCAGGTGTCAGGAAAAGTGTTTTGAAGCGTGGGCAGCTGGTGT 

GGTGAGTTGGGATGTGCGAGGGTGTGAGGGTCTTGTGGTGGTGCAGAAGGTGCGTCTTGG 

GTGAGATGGAGACGTGAGGGCGCGTGGAGGATGAGTGCAGTGTATATGAATGGAGGTGGG 

GTGGTGAACCCCCAGTATGCCCGGTGGGATGGGCGGGACAGTGTAGAAAGTGGCTGTCAG 

AGGGAGAGTAGGAAGGAGGGTGAGGAGGGACAGGCGGGCCAGGTGAGGGGCTTCGAGAAA 

GTGAGACAGGACATGTCGGAGGATGAGAAGGTGGTGAGGGAGATCAGGGTGGGGAAAGGG 

ATAGGCTTGTAGGGAATTGGAGGGGAAATCGGAAGTGGAAAGTTGTCGCAAGTGAAGGTT 

GGGATTCACTCGCTAACGAAAGAAAAGGTGGGCATTAAGATCCTGGACAAGACCAAGTTA 

"GACCAGAAAACCCAGAGGCTACTATCCCGAGAAATCTCCAGCATGGAAAAGCTGCACCAT 

GCCAACATCATCCGCCTTTACGAAGTGGTGGAGACCCTATCCAAGCTGCAGTTGGTGATG 

GAGTATGCAGGGGGTGGGGAGGTGTTGGGAAAAATTAGCACTGAGGGGAAGCTCTCTGAA 

CCAGAAAGCAAGGTGATGTTCTCGCAGATTGTGTGTGGCGTGAAGGACATGGATGAAAAG 

CAAATTATTCATAGAGATGTGAAAGCAGAAAATGTATTCTATACCAGTAATACTTGTGTG 

AAGGTGGGCGATTTTGGATTGAGGACAGTAAGCAAAAAAGGTGAAATGCTGAACACTTTG 

TGTGGGTGTGCTCGGTAGGGTGGGGGTGAAGTCTTGCGGGACGAGCAGTACATCGGCATT 

TACGTGGATATCTGGGCCTTGGGGGTGGTTTTGTAGTTGATGGTGACTGGCACCATGCCA 

TTTCGGGGAGAAACCGTGGCCAAAGTAAAAAAGAGGATGGTCGAGGGCACATAGAGTGTA 

GCGGGGCACGTGTCAGAGGGCTGGCACCGAGTGATCGGAGGAGTGGTTCAGCAGATCCGC 

AGGGAGAGGTAGGGAATCGAGTGGATCATGAATGATGAATGGATGCAAGGGGTGCCATAC 

GGTACACGTTTGGAACGTTTGCAAGTGGATGGGAAAGATTTGTGGGAAAGGAGGAGTGTG 

AAGGAAGAAGAAAATGAGGTCAAAAGCAGTTTAGAAGATTTGGGGATTAGAGAAGAGCAT 

ATTGGAAATAACGAAGGGAGAGATGCTCGCAGGTCAATGACAGGGGTGTATAGAATTATT 

TTAGATAGAGTCCAAAGGAAGAAGGGTTTGGAAAGTGTGGCAGTCATGATGCTAGCAGAC 

GGTAAAGAAAGAGAGGTCAAAAAAGGGTCGGGTGTGTACAGAGGGATAAGACACACATCG 

AAATTTTGCTCGATTTTATAAATTGCACTAGACTGCTTGTAACTAACCAAGATGATTGTT 

GGTGGTTGTAAATTTTTTTCAAGGAGAAGTTGAGTGGAGAGATTTTTGTAATTTTTAAAT 
AAACTTAAATTTGAGATATGGAAAAAAAAAA 

SEQ ID NO: 34_R3 1237_1_H, AAC33487 

ATGTCCACTAGGACCCCATTGCCAACGGTGAATGAACGAGACACTGAAAACCACACGTCA 
GATGGAGATGGGGGTCAAGAAGTTAGGTGTGGTACGAGGGGGTCAGGAGGTGGGTGTAGA 
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AACTCTATAGCCTCCTGTGCAGATGAACAACCTCACATCGGAAACTACAGACTGTTGAAA 

ACAATCGGCAAGGGGAATTTTGCAAAAGTAAAATTGGCAAGACATATCCTTACAGGCAGA 

GAGGTTGCAATAAAAATAATTGACAAAACTCAGTTGAATCCAACAAGTCTACAAAAGCTC 

TTCAGAGAAGTAAGAATAATGAAGATTTTAAATCATCCCAATATAGTGAAGTTATTCGAA 

GTCATTGAAACTGAAAAAACACTCTACCTAATCATGGAATATGCAAGTGGAGGTGAAGTA 

TTTGACTATTTGGTTGCACATGGCAGGATGAAGGAAAAAGAAGCA/^GATCTAAATTTAGA 

CAGATTGTGTCTGCAGTTCAATACTGCCATCAGAAACGGATCGTACATCGAGACCTCAAG 

GCTGAAAATCTATTGTTAGATGCCGATATGAACATTAAAATAGGAGATTTCGGTTTTAGG 

AATGAATTTACTGTTGGCGGTAAACTCGACACGTTTTGTGGCAGTCCTGCATAGGCAGCA 

CCTGAGCTCTTCCAGGGCAAGAAATATGACGGGCCAGAAGTGGATGTGTGGAGTCTGGGG 

GTCATTTTATACACAGTAGTCAGTGGGTCACTTCCCTTTGATGGGCAAAACGTAAAGGAA 

CTGAGAGAGAGAGTATTAAGAGGGAAATACAGAATTCCCTTCTACATGTCTACAGACTGT 

GAAAACCTTCTCAAACGTTTCCTGGTGCTAAATCCAATTAAACGCGGCACTCTAGAGCAA 

ATCATGAAGGACAGGTGGATCAATGCAGGGCATGAAGAAGATGAACTCAAACCATTTGTT 

GAACCAGAGCTAGACATCTCAGACCAAAAAAGAATAGATATTATGGTGGGAATGGGATAT 

TCACAAGAAGAAATTCAAGAATCTCTTAGTAAGATGAAATACGATGAAATCACAGCTACA 

TATTTGTTATTGGGGAGAAAATCTTCAGAGCTGGATGGTAGTGATTCCAGTTCTAGCAGC 

AATCTTTCACTTGCTAAGGTTAGGCCGAGCAGTGATCTCAACAACAGTACTGGCCAGTGT 

CCTCACCAGAAAGTGCAGAGAAGTGTTTCTTCAAGCC7UVAAGCAAAGACGCTACAGTGAC 

CATGCTGGACCAGCTATTCCTTCTGTTGTGGCGTATCCGAAAAGGAGTCAGACAAGCACT 

GCAGATGGTGACCTCAAAGAAGATGGAATTTCCTCCCGGAAATCAAGTGGCAGTGGTGTT 

GGAGGAAAGGGAATTGCTCCAGCCAGTCCCATGCTTGGGAATGCAAGTAATCCTAATAAG 

GCGGATATTCCTGAACGCAAG/^AAAGCTCCACTGTCCCTAGTAGTAACACAGCATGTGGT 

GGAATGACAGGACGAAATACTTATGTTTGCAGTGAGAGAACTACAGCTGATAGACAGTCA 

GTGATTCAGAATGGCAAAGAAAACAGCACTATTCCTGATGAGAGAACTGCAGTTGCTTCA 

ACACACAGTATCAGTAGTGCAGCCACGCCAGATCGAATGCGCTTCGCAAGAGGGAGTGGG 

AGTCGTAGCACTTTCCACGGCCAGCCCCGGGAACGGCGAACCGCAACATATAATGGCCCT 

CCTGCCTCTCCCAGCCTGTCCCATGAAGCCACAGCATTGTCCCAGACTCGAAGCCGAGGG 

TCCACTAATCTCTTTAGTAAATTAACTTCAAAACTCACAAGGAGTCGCAATGTATCTGCT 

GAGCAAAAAGATGAAAACAAAGAAGCAAAGCCTCGATGCCTACGGTTCACCTGGAGCATG 

AAAACCACTAGTTCAATGGATCCGGGGGACATGATGCGGGAAATGCGCAAAGTGTTGGAC 

GCCAATAACTGCGACTATGAGCAGAGGGAGCGCTTCTTGCTCTTCTGGGTCCACGGAGAT 

GGGCACGCGGAGAACCTCGTGCAGTGGGAAATGGAAGTGTGCAAGCTGCCAAGACTGTCT 

CTGAACGGGGTCCGGTTTAAGCGGATATCGGGGACATGCATAGCCTTCAAAAATATTGGT 

TCCAAAATTGCCAATGAGCTAAAGCTGTAA 

SEQ ID NO: 3 5_W9083 9__M 

AAAGGGCCGTCCTGGTCCAGCCGTTCCCTGGGTGCCCGTTGCCGGAACTCTATCGCTTCC 

TGCCCTGAGGAACAACCCCATGTGGGCAACTATAGGCTGCTAAGGACCATCGGGAAGGGC 

AACTTCGCCAAAGTCAAGCTGGCTCGGCATATCGTCAGGGGGCGGGAGGTCGGTATTAAG 

ATCATTGATAAGACGCAGCTGAACCCGAGTAGGTTGCAGAAGCTGTTCAGAGAAGTCCGA 

ATTATGAAGGGACTCAACGACCCCAACATCGTGAAGCTTTTTGAGGTGATAGAGACGGAG 

AAGAGGCTATACCTGGTGATGGAATAGGGTAGCGCAGGAGAAGTGTTTGACTACCTCGTG 

TCGGACGGCCGCATGAAGGAGAAGGAGGGTCGAGCCAAGTTGCGGGAGATGGTGTGAGCC 

GTGCACTACTGTCATCAGAAGAACATTGTAGACAGGGATGTAAAGGGTGAAAAGGTGTTG 

CTGGATGCCGAGGCCAAGATCAAAATCGGCGACTTCGGGTTCAGGAATGAGTTCAGGGTG 

GGCTCCAAGCTGGACACCTTCTGTGGGAGCCCCCCATACGCCGCCCCAGAGCTGTTCCAG 

GGCAAGAAGTATGATGGGGCAGAGGTGGACATGTGGAGGCTGGGTGTCATGGTGTACACG 

CTGGTCAGCGGCTCCCTGCCCTTCGATGGGCACAACCTCAAGGAGCTGCGGGAGCGAGTC 

CTCAGAGGAAAGTACCGGGTCCCCTTCTACATGTCTACAGACTGCGAGAGCATTCTGCGG 
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A G A T TT G TG G T G G 7 G AA C G C G 3 C AAAA CGCTGTACTCTG 3 A G G AAA T C A TG AAA G A C AAA 
TG GATCAAC AT CGGGTATGAGGGTGAGGAGGTGAAGCG AG ACAGGGAGCTGAAAGAAG AG 
CGGATGCCGGGTGGGAAAGCGAGCTGCAGTGCAGTGGGGAGTGGAAGTGGAGGGTTGCCC 
CCCTCCAGCCCCATGGTCAGCAGTGCCCACAACCCCAATAAGGCAGAGATCCCTGAGCGG 
CGGAAGGACAGCACTAGCACCCCTAACAACCTCCCCCCCAGCATGATGACCCGAAGAAAC 
ACCTATGTGTGCACAGAGCGACCAGGATCTGAACGCCCGTCCTTGTTGCCAAATGGCAAA 
GAAAATAGCTCGGGTACCTGGCGGGTGGCCCGTGCCTCGCCTTGGAGTCATAGCCTGGCT 
CCGCCGTGAGGCGAGCGGAGCCGCCTGGCTCGGGGCTCCACCATCCGCAGCACCTTCCAT 
GGGGGCGAGGTCCGAGACCGGCGGGGAGGGAGGGGGAGTGGCGGGGGTGTGGAGAATGGA 
CCCCCAGCGTCACGGACGGTTGGGCACGAGGGCGCACCGCTGCCCTCCGGGCGGGCTCGC 
CCCACGAGCAAGCTCTTGACCAAGCTGACCTCCAAACTGACCGGAAGGGTCACAGACGAA 
CGTGAGAGAATCGGGGGAGCTGAGGTCACAAGTTGCCATCTAGCTTGGGATAAAACGGAA 
AGCGCCCCCAGGCTGGTCCGATTCCCGTGGAGTGTGAAGGTGACCAGCTCGCGAGGTTCG 
TGAGGGGGTGATGGGTGCGCTGCGACAGGCCACA 

SEQ ID NO: 36_406786 . 5__H 

GTAGCCGGCTTGGGGTGAGGGTGGGCTGATGCAGTTGTTAGAGGTGGAAGGTTGGCAGTT 
GGCGTCCGTTCTTGGCATGGAGGTCGGGGGCTTAACAGTCTTTGAAGAGGACCAGAGATG 
GCTTTGCCAGAGGGTGCGGTTGCCAGTGTGAGGAGAGGGCCCAGCTGCACAGAGGACTGC 
TGAGCCCAGGAGGTCGTTTTCCTGAGGCCACAGACAGGTGAGGAGAAGGAATGGGCTTTC 
CAGACTCTGCCAGAGCAGGACGGCGCTCTGTGAAGACAGATGGAGCTGGTATTGTCTATC 
ATCACTGGGTGCCCAGAATATTTGTACAAGTAAACTGCAGTGCCCTGCTGCCCCTGAGCA 
CACGGAGCCGTCCGAACCGCGGGGCAGTGTGTCGTGGTGGTCCCTGCTGCGGGGAGTGTC 
CTCAGGGTGGTCCTCACGTCTGCTTCCGGGCCCTGTGTGCAAGGGTAACAAGGCCATCTT 
CACGGTGGATGCCAAGACCACAGAGATCCTCGTTGGTAAGGACAAAGCTTGCGGGCTCCT 
GGGGTACAGCAGCCAGGACCTGATTGGCCAGAAGCTCACGCAGTTCTTTCTGAGGTCAGA 
TTCTGATGTGGTGGAGGCCCTCAGCGAGGAGCACATGGAGGCCGACGGCCACGCTGCGGT 
GGTGTTTGGCACGGTGGTGGACATCATCACCCGTAGTGGGGAGAAGATTCCAGTGTCTGT 
GTGGATGAAGAGGATGCGGCAGGAGCGGCGGGTATGCTGCGTGGTGGTCCTGGAGCCCGT 
GGAGAGGGTCTCGACGTGGGTCGCTTTCCAGAGCGATGGCAGCATCACGTCATGTGACAG 
TGTGTTTGGTCATOTTCAGGGGTACGTGTCTGGGGAGGAGGTGGGTGGGCAGGATATCAC 
AGACCTGATCGCTTCTGTGCAGCTCCGTGGTTCTGGCCAGCACATCCCAAAGAATCTCAA 
GATTCAGAGGTCTGTTGGAAGAGCCAGGGACGGTACCACCTTCGCTCTGAGCTTAAAGGT 
GAAATGGGAAGCCAGCAGCGAGGAGGCGAGCACGGGTGAGGCGGCCCCTGTGAGCGGCTA 
CGGGGGATCTGTCTGGGTGTTGTGCACCATCAGTGGCCTCATCACCCTCCTGGCGGATGG 
GACCATCCACGGCATCAACCACAGCTTCGCGCTGACAGTGTTTGGTTACGGAAAGAGGGA 
GCTCCTGGGCAAGAATATCACTTTGGTGATTGCTGGTTTCTAGAGCTAGATGGAGGTTGC 
GTACAACAGCTCATTACAGCTGCGAGACGTGGGGAGCTGCCTGGACGTCGGCAATGAGAG 
TGGGTGTGGGGAGAGAACGTTGGACCCGTGGGAGGGCGAGGAGCCAGCTGAGGGGGGGCA 
GGATCCAAGGATTAATGTCGTGCTTGCTGGTGGGCACGTTGTGCCCCGAGATGAGATGCG 
GAAGCTGATGGAAAGCCAAGACATCTTCACCGGGACTCAGACTGAGCTGATTGCTGGAGG 
GCAGCTCCTTTGCTGCGTGTCACCTCAGCCTGGTGCAGGGGTGGACAATGTCCCAGAAGG 
AAGCGTGGCAGTGCACGGTGAACAGGCGGTGCCCAAGGACCAGCAAATGACTGCCTTGGG 
GAGAGAGGAAGGTGTGGCAATAGAGAGCCCCGGACAGGATGTTCTGGGAGAAAGGAGGTC 
TGAAGGAGTGGATGTGAAGGCATTTGCTTCGTGGGAAGATTGTGAAGGTCCAGTGGCAGC 
TGAGGATGGGGGCAGTGATGGTGGGATGTGTGGCGTGTGTGAGAAGGGGCAGGTAGAGGG 
GATGGGAGTCAGTGGTCCCAGCGGTTCAGACCTTTGGGCTGGGGCTGCCGTGGCCAAGCC 
CGAGGGCAAGGGTGAGGTGGGGGGGGGGAGCGTCGTGATGCAGTGCCCTTGCTATGGGAG 
TGAATGGGGGTTGTGGTGGGGAAGGGAGGAGTTGGCCGCGAGGGGCTGTGGGATGGGAGG 
CCTCTGGTTTGGGACACCTAGTCTAGATGAGCCGTGGCTGGGAGTGGAAAAGGAGCGAGA 
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AGAGCTGCAGACCTGCTTGATTAAGGAGCAGCTGTCCCAGTTGAGCCTTGCAGGAGCCCT 

GGATGTCCCCCACGCCGAACTCGTTCCGACAGAGTGCCAGGCTGTCACCGCTCGTCTGTC 

GTCCTGCGATCTGGGAGGCAGAGACCTGTGCGGTGGCTGCACGGGCAGCTCCTCAGCCTG 

CTATGCCTTGGCCACGGACCTCCCTGGGGGCCTGGAAGCAGTGGAGGCCCAGGAGGTTGA 

TGTGAATTCGTTTTCCTGGAACCTCAAGGAACTCTTTTTCAGTGACCAGACAGACCAAAC 

GTCATCAAATTGTTCCTGTGCTACGTCTGAACTCAGAGAGACACCCTCTTCCTTGGCAGT 

GGGCTCCGATCCAGATGTAGGCAGTCTCCAGGAACAGGGGTCGTGTGTCCTGGATGACAG 

GGAGCTGTTACTACTGACCGGCACCTGTGTTGACCTTGGCCAAGGCCGACGGTTCCGGGA. 

GAGCTGTGTGGGACATGATCGAACAGAACCGCTTGAGGTTTGTTTGGTGTGCTCTGAGCA 

TTATGCAGCAAGCGACAGAGAAAGCCCAGGACACGTTCCTTCCACGTTGGATGCTGGCCC 

TGAGGACACGTGCCCATCAGCAGAGGAGCCAAGGCTGAACGTCCAGGTCACCTGCACGCG 

CGTGATCGTGATGCGCGGGGCTGCTGGCCTGCAGCGGGAGATCCAGGAGGGTGGCTACTG 

CGGGAGCTGCTACCATCGAGATGGCTTACGGCTGAGTATACAGTTTGAGGTGAGGGGGGT 

GGAGCTCCAGGGCCCCACACCTCTGTTCTGCTGGTGGCTGGTGAAAGACCTCCTCCACAG 

CCAACGCGACTCAGCCGCCAGGACCCGCCTGTTCCTTGCCAGCCTGGCCGGCTCCACCCA 

CTCTACCGCTGCTGAGCTCACCGGACCCAGCCTGGTGGAAGTGCTCAGAGCCAGACCCTG 

GTTTGAGGAGCCCCCCAAGGCTGTGGAACTGGAGGGGTTGGCGGCCTGTGAGGGCGAGTA 

CTCCCAAAAGTACAGTACCATGAGCCCGCTGGGCAGTGGGGCCTTCGGGTTCGTGTGGAG 

TGCTGTGGACAAGGGAAAAAACAAGGAGGTGGTGGTGAAGTTTATTAAGAAGGAGAAGGT 

CTTGGAGGATTGTTGGATTGAGGATCCCAAACTTGGGAAAGTTACTTTAGAGATCGGAAT 

TCTATCCAGGGTGGAGCACGCCAATATCATCAAGGTATTGGATATATTTGAAAACCAAGG 

GTTCTTCCAGCTTGTGATGGAGAAGCACGGCTCCGGCCTAGACCTGTTCGCTTTCATCGA 

CCGCCACCGCAGGCTGGATGAGCCCCTGGCGAGCTACATCTTCCGACAAGTGAGAGCAGG 

CCAGAGCCGTCTAGTGTCAGCAGTGGGATACCTGCGCTTGAAGGACATCATCCACCGTGA 

CATCAAGGATGAGAACATCGTGATCGCTGAGGACTTCACAATCAAGCTGATAGACTTTGG 

CTCGGCCGCCTACTTGGAAAGGGGAAAATTATTTTATACTTTTTGTGGGACCATCGAGTA 

CTGTGCACCGGAAGTTCTCATGGGGAATCCCTACAGAGGGGCGGAGCTGGAGATGTGGTC 

TCTGGGAGTCAGTGTGTACACGGTGGTCTTTGAGGAGAACCCCTTCTGTGAGGTGGAGGA 

GACCGTGGAGGCTGCCATACACCCGCCATAGGTGGTGTGCAAAGAACTCATGAGCCTTGT 

GTCTGGGCTGCTGCAGCCAGTCCGTGAGAGACGCAGCACCTTGGAGAAGGTGGTGAGAGA 

CCCGTGGGTAACACAGCCTGTGAATCTTGCTGACTATACATGGGAAGAGGTGTTTCGAGT 

AAACAAGCCAGAAAGTGGAGTTCTGTGCGCTGCGAGCCTGGAGATGGGGAAGAGGAGGGT 

GAGTGATGTGGCCCAGGCTCAGGAGCTTTGTGGGGGCCCCGTTCCAGGGGAGGGTCCTAA 

TGGCCAAGGCTGTTTGCATCCCGGGGATCCCCGTCTGCTGACCAGCTAAACACCAATTTC 

TTCCTGCTTTTCTCCACTTGGTTTGGAAAATCACACAGTTTTCAGGCTGCATCTGTTTG 

SEQ ID NO: 3 7_AA54 4 8 3 8_M 406786_M 

CCACGCGTGCGCATCCCTGCTTGGATGAGCCCCTGGCGAGTTTCATCTTTCGACAACTAG 
TGTCTGCTGTAGGATACCTGCACTCCCAAGGCATCATCCATAGAGACATCAAGGATGAGA 
ACATTGTGATTGCTGAGGACTTCACAATTAAGCTGATAGATTTTGGCTCAGCTGCCTACT 
TAGAGAGGGGCAAACTATTTTATACCTTTTGTGGAACAATCGAATACTGTGCACCTGAGG 
TTCTCATTGGAAATGCCTACAGAGGGCCAGAGGTGGAGATGTGGTCTCTGGGGGTCACCC 
TGTACACGCTCATCTTCGAGGAGAATCCCTTCTGTGAGGTGGAGGAGACCATGGAGGCAG 
TTATTCATCCCGCATTGCTGGTTTGCCAAGAACTTATGAGTGTTGTGTGTGGAGTGGTGC 
AGCCTTGCCCTGAGCAGCGGACCACTTTGGAGAAGCTGATCAGGGACCCCTGGGTGACAC 
AGCGTGTGAACGTTGCTAGCTATAGTTGGGAAGAGGTGTGTAGGAGGAACCAGCCAGAAA 
GTGGCCTGCTGTCAGCTGCAAGTCTGGAGATTGGGAGTAGGAGTCCAAGTGAAATGGCTC 
AGAGAGAGGGTCTCTGTGGGCCTCCTGCTC CCAGGGAGACTCGTGGTGA.CC AG CACTGCT 
TGCATCTTAAGGACCCCTCTTTGCCAGTCAGCTGAGCAAGCTCTCCTGCTCTTTGGTTTG 
GGCAGTTGTATGGATTTCAGGGCTTTCTACCTGGAGAAAGGAAGTTGTGAAGGATTGGGA 
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TGACTTCTGCTTCTAGATTCCTATGCAAATGCTACAAGAGCCTGCGATGCTAGTTTTCTT 
AGGTTTATGATATAGACTTGTAATTCATGTTTTTTTATAACCTTGAAAATCATTCTAATG 
TTCAGTTATACTGTACTATTAAAGGGGTTTAAGTTGTAAGCCTCAGAAAGACACAAGGAG 
TGTTTAAGTTCTCTATTTTTGTTGTTTGTTTTTGCTTGTAAGTTTTTGAGACAGGATCTC 
ACCATGTAACTTTGGCTGGCCTGGAACTCAACTATGTAGACCAGGTAGACCTTAAACTGA 
GAGATCTGCCTGCGCTTGCCTCCCAAGCATTAGGACTGATGGTGTGTGTCACCATGCGCA 
GTTCTTGCTGGTTTTGTGTGTAGGTTTCTTTCGCACTGACTTGGTAGATGTGACATGTGA 
CAGATGTATGGAGTCTA.TAGAAGTGGCCAGACAAAAATGGCCAGAATATTTATTTATTTT 
CTTAAAAATTTTCCAAATTAAAGCTAGTTAGTTAACAGTTAAACTGGGCAGGAGTATATG 
AGATAAAGTTGGTTTTCTATTTGTTTTTGT 

SEQ ID NO: 3 8_AA7 8 5 7 3 5_H 

GGCACGAGGCGCGCCTGGGTGGGCCCTGCGGAGGANGGGAAGGAGCGAAGGAGCGAAGGA 
GGAAGCGGAGCGCAGTTCGCCCAAGGGAAGCCGCGCTGCCAACCCTCCCGCCCGCCCGCG 
GTCCTGTCCGCCGTGTCTAGCAGCGGGGCCCAGCATGGTCATGGCGGATGGCCCGAGGCA 
GTTGCAGCGCGGGCCGGTGCGGGTGGGGTTCTACGACATCGAGGGCACGCTGGGCAAGGG 
CAACTTCGGTGTGGTGAAGGTGGGGCGGCACCGGATCACCAAGACGGAGGTGGGAATAAA 
AATAATCGATAAGTCTCAGCTGGATGCAGTGAACCTTGAGAAAATCTACCGAGAAGTACA 
AATAATGAAAATGTTAGACCACCCTCACATAATCAAACTTTATCAGGTAATGGAGACCAA 
AAGTATGTTGTACCTTGTGACAGAATATGCCAAAAATGGAGAAATTTTTGACTATCTTGC 
TAATCATGGCCGGTTAAATGAGTCTGAAGCCAGGCGAAAATTCTGGCAAATCCTGTCTGC 
TGTTGATTATTGTCATGGTCGGAAGATTGTGCACCGTGACCTCAAAGCTGAAAATCTCCT 
GCTGGATAACAACATGAATATCAAAATAGCAGATTTCGGTTTTGGAAATTTCTTTAAAAG 
TGGTGAACTGCTGGCAACATGGTGTGGCAGCCCCCCTTATGCAGCCCCAGAAGTCTTTGA 
AGGGCAGCAGTATGAAGGACCACAGCTGGACATCTGGAGTATGGGAGTTGTTCTTTATGT 
CGTTGTCTGTGGAGCTCTGCCCTTTGATGGACCGACTCTTCCAATTTTGAGGCAGAGGGT 
TCTGGAAGGAAGATTCCGGATTCCGTATTTCATGTCAGAAGATTGCGAGCACCTTATCCG 
iw,-v-, r .M, • • AAGGATGTTGGTCCTAG ACCCATC CAAACGGCTAAC C ATAGC GCAAATCAAGGAGCATAA 

ATGGATGCTCATAGAAGTTCCTGTCCAGAGACCTGTTCTCTATCCACAAGAGCAAGAAAA 
TGAGCCATCCATCGGGGAGTTTAATGAGCAGGTTCTGCGACTGATGCACAGCCTTGGAAT 
AGATCAGCAGAAARCCATTGAGTCTTTGCAGAACAAGAGCTATAACCACTTTGCTGCCAT 
TTATTTCTTGTTGGTGGAGCGCCTGAAATCACATCGGAGCAGTTTCCCAGTGGAGCAGAG 
ACTTGATGGCCGCCAGCGTCGGCCTAGCACCATTGCTGAGCAAACAGTTGCCAAGGCACA 
GACTGTGGGGCTCCCAGTGACCATGCATTCACCGAACATGAGGCTGCTGCGATCTGCCCT 
CCTCCCCCAGGCATCCAACGTGGAGGCCTTTTCATTTCCAGCATCTGGCTGTCAGGCGGA 
AGCTGCATTCATGGAAGAAGAGTGTGTGGACACTCCAAAGGTCAATGGCTGTCTGCTTGA 
GGCTGTGGCTCCTGTGCTGGTGCGGAAGGGATGCCAGTCACTGCCCAGCAACATGATGGA 
GACCTCCATTGACGAAGGGCTGGAGACAGAAGGAGAGGCCGAGGAAGACCCCGGTCATGG 
GTTTGAGGCATTTCAGTCCACACGCAGCGGGCAGAGACGGCACACTCTGTCAGAAGTGAG 
CAATCAACTGGTCGTGATGCCTGGGGCAGGGAAAATTTTCTCCATGAATGACAGCCCCTC 
CCTTGACAGTGTGGACTGTGAGTATGATATGGGGTCTGTTCAGAGGGACCTGAACTTTCT 
GGAAGACAACCGTTCCCTTAAGGACATCATGTTAGCCAATCAGCCTTCACCCCGCATGAC 
ATCTCCCTTCATAAGCCTGAGACCTACCAACCCAGCCATGCAGGCTCTGAGCTCCCAGAA 
ACGAGAGGTCCACAACAGGTCTCCAGTGAGCTTCAGAGAGGGCCGCAGAGCATCAGATAC 
CTCCCTCACCCAGGGAATTGTAGCATTTAGACAACATCTTCAGAATCTGGCTAGAACCAA 
AGGAATTCTAGAGTTGAACAAAGTGCAGTTGTTGTATGAACAAATAGGACCGGAGGCAGA 
CCCTAACCTGGCGCCGGCGGCTCCTCAGCTCCAGGACCTTGCTAGCAGCTGCCCTCAGGA 
AGAAGTTTCTCAGCAGCAGGAAAGCGTCTCCACTCTCCCTGCCAGCGTGCATCCCCAGCT 
GTCCCCACGGCAGAGCCTGGAGACCCAGTACCTGCAGCACAGACTCCAGAAGCCCAGCCT 
TCTGTCAAAGGCCCAGAACACCTGTCAGCTTTATTGCAAAGAACCACCGCGGAGCCTTGA 
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GCAGCAGCTGCAGGAACATAGGCTCCAGCAGAAGCGACTCTTTCTTCAGAAGCAGTCTCA 

ACTGCAGGCGTATTTTAATCAGATGCAGATAGCAGAGAGCTCCTACCCACAGCGAAGTCA 

GCAGGTGCCGCTTCGGGGCCAGGAGACTCCACCGCCTTGTCAGCAGGCCCCAGCGTTCAG 

CCTGACCCAGCCCCTGAGCCCCGTCCTGGAGCCTTCCTCCGAGCAGATGCAATACAGCCC 

TTTCCTCAGCCAGTACCAAGAGATGCAGCTTCAGCCCCTGCCCTCCACTTCCGGTCCCCG 

GGCTGCTCCTCCTCTGCCCACGCAGCTACAGCAGCAGCAGCCGCCACCGCCACCACCCCC 

TCCACCACCACGACAGCGAGGAGCTGGCGCAGCCCCGTTACAGTTCTGGTATCAGAGTTG 

TGAGCTGCCAAGCGCTGCTTCCCCTGCGGCAGACTATCCCACTCCCTGTCAGTATCCTGT 

GGATGGAGCCCAGCAGAGCGACCTAACGGGGCCAGACTGTCCCAGAAGCCCAGGACTGCA 

AGAGGCCCCCTCCAGCTACGACCCACTAGCCCTCTCTGAGCTACCTGGAGTCTTTGATTG 

TGAAATGCTAGACGCTGTGGATCCACAACACAACGGGTATGTCCTGGTGAATTAGTGTCA 

GCACAGG AATTGAGGTGGGTCAGGTGAAGGAAGAGTGTATGTTCCTATTTTTATTCCAGG 

CTTTTAAATTTAAAGCTTATTTTCTTGCCCTCTCCCTAACGGGGAGAAATCGAGCCACCC 

AACTGGAATCAGAGGGTCTGGCTGGGGTGGATGTTGCTTCCTCCTGGTTCTGCCCCACGA 

CAAAGTTTTCTGTGGCAAGTGCTGGAACATAGTTGTAGGCTGAGGCAGGAGAATGGCGTG 

AACCCGGGAGGCGGAGCTTGCAGTGAGCCAAGATCGTGCCACTGCACTCCAGCCTGGGCG 

ACTGAGCAAGACTCCACCTCAAAAAAAAAAAAAAAGGACAAGAGCAGTATCATCTGGCTC 

TGTTTCTAAACTGGACAAAGAGATTTTCTTAAAGTTTGTATGATCTCCCTTCTGACAGGT 

TCTACAGTGTGGTCTGAAGGACCTGTAATGTCAGAGCGCTTGTGTGGCCGTTGGTGGGAG 

GTGAACGAAAGCAGTGGAGCCTCTCACCTTCCAGTAGCCTCTCACATTCTTATTTTACCA 

TTTTTGTCCTAATTAAGGTAGCCTAGCTGATTCTAGAAGACAGCCATCCTACGTGCACGC 

CCACCTTGTGTCCACATCTTCTCCAGGCAGGTTTCAACCTATCAGCAGACTCAGGCACAC 

ACTGGGGCACAGATAGAGAACCAGGCGGCAGCAGTGCTCGCAGACCCACCCAGGGAGAGC 

TGTGATGGGTTCTGCCCAGATACTCTGCTCGCCCACCCACAAGGGAGCAATAGCTTATAT 

TTGTACATTAGTTTTACCAAGCACTTTCTCTTCTAACCCTCACAACAATTCTATGAAATT 

AGCTGGGGAGATACTGTCCTTATTTTTCACAGCTGAAGAAACCAAAGCTTTGGGAAGTTT 

GTGACTTCTCTGAGATCACAGCTGGTGATAGAAGGAGCTGGGACACGGGCTTGGGTTGAG 

TGGCTTCTGGTTTTGGTTCTCTGGCTTCTAGTGCTGGAAGAAGCCCTCTCTTTCCCTTCT 

CTTTCCTCAGTAGCATCTGACTCTTTTCATAAGCAAACAGCTGTATAAACAAAGCCCCCA 

TTTTGGTCAAGCACAGGGTGAATGTGATATTGTTCCCACAACCTTATTCTCCACTCAACA 

GCCGCCTGGCTTTGGGGAAGAGGCCGCCTTCAGGTGACAGTGCAGCTGTCCAGGTGGCCG 

TGCACTGAACCAGGCTGAGGGAGACAAAAACCCCGCAGACCCGCCTGCCTTTCAGGGTCC 

AGTTAACTGCAGAAGTTTAGGCTCACCTCAAAGATGTCTAGTTTTTCCAAGTTACAATAG 

AGCAGTTTCCTACAGAACACCCCGTTCCTCAATTGCCAAGGGGCCGCATCGCACGGGATC 

AGGCCACCACTGCAGGCCAGCAGATTCCACCCCAGGAACGGTCATGAACTCAGCCTTTGT 

CTCAACGAGGGGCGTAACATTTCCTTACAGTCAAGCCCCATCAACTAGAAGTGCTTATTA 

CTTTTAGGATTAAAAAAGTAATAACAGACTTTGAGTTAATAGTGTGTCTTTTCAGAGGCA 

AAGTGGGTGGGTAGAGGGGAGGTTTAAAAATAGAAGTACAAAAGAACATGGTGGAAACAT 

ATGACCCCAGATGGAATAATGTCACATTCCCCAGTGCAGATAATGGGCTGGTGCTGGGTC 

TGTGGTGTCTGTCTGCAGAAGATTTGCTGAGTGAAGGAAATTCAAGTGGTGAGAGGTTTC 

CACCATGGGTGGTAAGAGAAAGGTGCCTTCACGAAAATCTCTGAAGGGGAAAGAAGTGGA 

G AGAAAGG TT TG C TTC AC TT CGGGG AC TGCAGTTTG AG AAA TAAAAGGG AT AG AGA GAT A 

TCTGCACTTTGTAGAAAGGGCAAGATTATTTGCTTATATCTGAAGGGAGGTGGGTGGTTT 

TGCTGGATGTTTGGTCTGAAAGAGTTACTTTTGATAAAGTTAATCTAATTGTAGTTATAT 

TTTCTGTGTGCTTTTTTTTAATTACTAAGAAAAAAATTGGTGAGTTCAGTAGCTTTGGTA 

TTATGAGTGCAAATCATAATAGCTCCAATGTGAAAAAAAAAATCAAAAGTATAACTTGTC 

ACTTAATGTTAGAAAATTGCCTAAAATGCAGTGTAATAAATAATCTCTGTACCAAATAGT 

AATTTAAATGGGGTAATTTTCTGCAAGGAAAATGTACTGTTTTTATGTTTCCAACCCTCT 

TGA 
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SEQ ID NO: 3 9_AA2 0 722 0_H 

GCTGTGGCTCCCCGTCCTGGTGCGGGACCTGTGCCCCGCGCTTCAGCCCTCCCCGCAAGC 
CTATTGATTCCCCTGCCGCCCTTGCTCCACCTCCTGCTCGCCATGGAGTCGCTGGTTTTC 
GCGCGGCGCTCCGGCCCCACTCCCTCGGCCGCAGAGCTAGCCCGGCCGCTGGCGGAAGGG 
CTGATCAAGTCGCCCAAGCCCCTAATGAAGAAGCAGGCGGTGAAGCGGGACCAGCACAAG 
CACAACCTGCGGCACCGCTACGAGTTCCTGGAGACCCTGGGCAAAGGCACCTACGGGAAG 
GTGAAGAAGGCGCGGGAGAGCTCGGGGCGCCTGGTGGCCATCAAGTCAATCCGGAAGGAC 
AAAATCAAAGATGAGCAAGATCTGATGCACATACGGAGGGAGATTGAGATCATGTCATCA 
GTCAACCACCCTCACATCATTGCCATCCATGAAGTGTTTGAGAACAGCAGCAAGATCGTG 
ATCGTCATGGAGTATGCCAGCCGGGGCGACCTTTATGACTACATCAGCGAGCGGCAGCAG 
CTCAGTGAGCGCGAAGCTAGGCATTTCTTCCGGCAGATCGTCTCTGCCGTGCACTATTGC 
CATCAGAACAGAGTTGTCCACCGAGATCTCAAGCTGGAGAACATCCTCTTGGATGCCAAT 
GGGAATATCAAGATTGCTGACTTCGGCCTCTCCAACCTCTACCATCAAGGCAAGTTCCTG 
CAGACATTCTGTGGGAGCCCCCTCTATGCCTCGCCAGAGATTGTCAATGGGAAGCCCTAC 
ACAGGCCCAGAGGTGGACAGCTGGTCCCTGGGTGTTCTCCTCTACATCCTGGTGCATGGC 
ACCATGCCCTTTGATGGGCATGACCATAAGATCCTAGTGAAACAGATCAGCAACGGGGCC 
TACCGGGAGCCACCTAAACCCTCTGATTGCCTGNNTGGCCTGATCCGGTGGCTGTTGATG 
GTGAACCCCACCCGCCGGGCCACCCTGGAGGATGTGGCCAGTCACTGGTGGGTCAACTGG 
GGCTACGCCACCCGAGTGGGAGAGCAGGAGGCTCCGCATGAGGGTGGGCACCCTGGCAGT 
GACTCTGCCCGCGCCTCCATGGCTGACTGGCTCCGGCGTTCCTCCCGCCCCCTCCTGGAG 
AATGGGGCCAAGGTGTGCAGCTTCTTCAAGCAGCATGCACCTGGTGGGGGAAGCACCACC 
CCTGGCCTGGAGCGCCAGCATTCGCTCAAGAAGTCCCGCAAGGAGAATGACATGGCCCAG 
TCTCTCCACAGTGACACGGCTGATGACACTGCCCATCGCCCTGGCAAGAGCAACCTCAAG 
CTGCCAAAGGGCATTCTCAAGAAGAAGGTGTCAGCCTCTGCAGAAGGGGTACAGGAGGAC 
CCTCCGGAGCTCAGCCCAATCCCTGCGAGCCCAGGGCAGGCTGCCCCCCTGCTCCCCAAG 
AAGGGCATTCTCAAGAAGCCCCGACAGCGCGAGTCTGGCTACTACTCCTCTCCCGAGCCC 
AGTGAATCTGGGGAGCTCTTGGACGCAGGCGACGTGTTTGTGAGTGGGGATCCCAAGGAG 
^'■CAGAAGCCTCCGCAAGCTTCAGGGCTGCTCCTCCATCGCAAAGGCATCCTCAAACTCAAT 
GGCAAGTTCTCCCAGACAGCCTTGGAGCTCGCGGCCCCCACCACCTTCGGCTCCCTGGAT 
GAACTCGCCCCACCTCGCCCCCTGGCCCGGGCCAGCCGACCCTCAGGGGCTGTGAGCGAG 
GACAGCATCCTGTCCTCTGAGTCCTTTGACCAGCTGGACTTGCCTGAACGGCTCCCAGAG 
CCCCCACTGCGGGGCTGTGTGTCTGTGGACAACCTCACGGGGCTTGAGGAGCCCCCCTCA 
GAGGGCCCTGGAAGCTGCCTGAGGCGCTGGCGGCAGGATCCTTTGGGGGACAGCTGCTTT 
TCCCTGACAGACTGCCAGGAGGTGACAGCGACCTACCGACAGGCACTGAGGGTCTGCTCA 
AAGCTCACCTGAGTGGAGTAGGCATTGCCCCAGCCCGGTCAGGCTCTCAGATGCAGCTGG 
TTGCACCCCGAGGGGAGATGCCTTCTCCCCCACCTCCCAGGACCTGCATCCCAGCTCAGA 
AGGCTGAGAGGGTTTGCAGTGGAGCCCTGAGCAGGGCTGGATATGGGAAGTAGGCAAATG 
AAATGCGCCAAGGGTTCAGTGTCTGTCTTCAGCCCTGCTGAACGAAGAGGATACTAAAGA 
GAGGGGAACGGGAATGCCCGCGACAGAGTCCACATTGCCTGTTTCTTGTGTACATGGAGG 

GG C C AC AG AGA 

SEQ ID NO: 4 0_AA4 2 6 5 8 0_H , MAK_V_H 

ATGGGGGCGGCGGCGGGGGACGGGCTCCTGGGGGAGCCGGGGGCGGCTGGGGGCGGCGGC 
GGCGCGGAGGACGCGGCCAGGCCCGCGGCGGCCTGCGAGGGAAGTTTCCTGCCTGCCTGG 
GTGAGCGGCGTGCCCCGCGAGCGGCTCCGCGACTTCCAGCACCACAAGCGCGTGGGCAAC 
TACCTCATCGGCAGCAGGAAGCTGGGCGAGGGCTCCTTTGCCAAGGTGCGCGAGGGGCTG 
C ACGTGC TGA C CG GGG AG AAGG TGGCC AT AAAAGTCATTG AT AAGAAG AG AG CCAAAAAG 
GACACCTATGTCACCAAA/^ACCTGCGGCGAGAGGGTCAGATCCAGCAGATGATCCGCCAC 
CCCAATATCACTCAGCTCCTTGATATTTTAGAAACGGAAAACAGCTACTACCTGGTCATG 
GAGCTGTGCCCTGGGGGCAACCTGATGCACAAGATCTATGAGAAGAAGCGGCTGGAGGAG 
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TCCGAAGC CCGCAGATA CATCCGACAGCTCATCTCTGCCGTA GAG CACCTGCACCGGGCC 

GGGGTGGTCCACAGAGACTTGAAGATAGAGAATTTGCTACTAGATGAAGACAATAATATC 1 

AAGCTGATTGACTTTGGTTTGAGCAACTGCGCAGGGATCCTGGGTTACTCGGATCCGTTC 

AGCACACAGTGTGGCAGCCCTGCCTACGCTGCACCTGAACTGCTCGCCAGGAAGAAATAC 

GGCCCCAAAATCGATGTCTGGTCCATAGGTGTGAACATGTATGCCATGTTGACCGGGACG 

CTGCCTTTCACGGTGGAGCCTTTCAGCCTGAGGGCTTTGTACCAGAAGATGGTAGACAA/i 

GAAATGAACCCCCTCCCCACTCAGCTCTCCACAGGTGCCATCAGTTTCCTGCGCTCTCTC 

CTGGAACCGGATCCTGTGAAGAGGCCAAATATTCAGCAGGCACTGGCGAATCGCTGGCTT 

AATGAGAATTACACGGGCAAAGTGCCCTGTAATGTCACCTATCCCAACAGGATTTCTCTG 

GAAGATCTGAGCCCGAGCGTCGTGCTGCACATGACCGAGAAGCTGGGTTACAAGAACAGC 

GACGTGATCAACACTGTGCTCTCCAACCGCGCCTGCCACATCCTGGCCATCTACTTCCTC 

TTAAACAAGAAACTGGAGCGCTATTTGTCAGGGAAATCTGACATCCAGGACAGCCTCTGC 

TACAAGACCCGGCTCTACCAGATAGAAAAGTACAGGGCCCCCAAGGAGTCCTATGAGGCC 

TCTCTGGACACCTGGACACGAGATCTTGAATTCCATGCCGTGCAGGATAAAAAGCCCAT^A 

GAACAAGAAAAAAGAGGGGATTTTCTTCATCGACCATTCTCCAAGAAGTTGGACAAGAAC 

CTGCCCTCGCACAAACAGCCCTCAGGCTCGCTTATGACACAGATTCAGAACACCAAAGCC 

CTCCTGAAGGACCGGAAGGCCTCCAAGTCCAGCTTCCCCGACAAAGATTCCTTTGGCTGC 

CGCAATATTTTCCGCAAAACCTCAGATTCCAATTGTGTGGCTTCTTCTTCCATGGAGTTC 

ATCCCCGTGCCACCGCCCAGGACCCCGAGGATTGTGAAGAAACCGGAGCCCCATCAGCCA 

GGGCCCGGAAGCACTGGCATCCCCCACAAGGAAGACCCCCTGATGCTGGACATGGTGCGC 

TCCTTCGAGTCTGTGGATCGCGACGACCACGTAGAAGTGCTGTCTCCCTCTCATCACTAC 

AGGATTCTGAACTCCCCGGTCAGCTTGGCTCGCAGAAATTCCAGCGAGAGGACGCTGTCC 

CCGGGTCTGCCATCCGGAAGCATGTCGCCTCTCCATACTCCTTTGCATCCAACTCTGGTC 

TCTTTTGCTCACGAAGATAAGAACAGCCCCCCAAAAGAGGAGGGCCTGTGTTGCCCACCT 

CCGGTTCCCAGCAATGGCCCCATGCAGCCTCTGGGGAGCCCCAATTGTGTGAAAAGCCGA 

GGCCGG TTC CCTATGATGGGCATCGGACAGATGTTAAGGAAGCGCC AT CAGAGTCTGC AG 

CCATCTGCAGATAGGCCCCTGGAGGCCAGCCTGCCCCCACTGCAGCCCCTAGCCCCTGTG 
AACCTTGCCTTTGACATGGCCGATGGGGTCAAGACCCAGTGCTAA 

SEQ ID NO: 41_Z36720_H 

ATGGACACAAAGCTGAACATGCTGAACGAGAAGGTGGACCAGCTCCTGCACTTCCAAGAA 

GATGTCACAGAGAAGTTGCAGAGCATGTGCCGAGACATGGGCCACCTGGAGCGGGGCCTG 

CACAGGCTGGAGGCCTCCCGGGCACCGGGCCCGGGCGGGGCTGATGGGGTTCCCCACATT 

GACACCCAGGCTGGGTGGCCCGAGGTCCTGGAGCTGGTGAGGGCCATGCAGCAGGATGCG 

GCCCAGCACGGTGCCAGGCTGGAGGCCCTCTTCAGGATGGTGGCTGCGGTGGACAGGGCC 

ATCGCTTTGGTGGGGGCCACGTTCCAGAAATCAAAGGTGGCGGATTTCCTCATGCAGGGG 

CGTGTGCCCTGGAGGAGAGGCAGCCCAGGTGACAGCCCTGAGGAGTGGGTAAAAGAGGAG 

GAGGTCTGTTTCATGCCTCCAGTTCCCCCAGCTCCGGGGGCAGCAGGACAGAGCCTGCAG 

AAGGATAAGGGGGAGCTGTCTGCCGAGCAGGGGATCTGGGCCACATTGATGACGCTGGTG 

ATCATGGTGACAGCGGCAAATAAAGAGCGAGTGGAAGAAGAGGGAGGAAAACCAAAGCAT 

GTGCTGAGCACCAGTGGGGTGCAGTCTGATGCCAGGGAGCCTGGGGAAGAGAGCCAGAAG 

GCGGACGTGCTGGAGGGGACAGCGGAGAGGCTGCCCCCCATCAGAGCGTCAGGGCTGGGA 

GCTGACCCCGCCCAGGCAGTGGTCTCACCGGGCCAGGGAGATGGTGTTCCTGGCCCAGCC 

CAGGCATTCCCTGGCCACCTGCCCCTGCCCACAAAGGTGGAAGCCAAGGCTCCTGAGACA 

CCCAGCGAGAACGTCAGGACTGGCCTGGAATTGGCTCCAGCACCCGGCAGGGTCAATGTG 

GTCTCCCCGAGCCTGGAGGTTGCACCAGGTGCAGGACAAGGAGCATCGTCCAGCAGGCCT 

GACCCTGAGCCCTTAGAGGAAGGCACGAGGCTGACTCCAGGGCCTGGCCCTCAGTGCCCA 

GGGCCTCCAGGGCTGCCAGCCCAGGCCAGGGCAACCCACAGTGGTGGAGAAACACCTCCA 

AGGGCAGCCCTGCTGAAGGGCGCTGTGGCCCCGGGCTTCTCTCGGAGGGACCTGGTGTTT 

CCTAGCATCTTCTGCGCCTGCCTAGGGATCTCCATCCACATACAAGAGATGGATACTCCT 
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GGGGAGATGGTGATGA3AGGGAGGGGGAGGGTTGGAGGGAGCGTGAGCAGAGAGGGTGGA 
GGAGGTGCGCAGGGAGGGAAGGAG^^GGGAGCTGGGAGGGGGrGGTGGGTGCAAGGGGGT 
GGGACTGAGGC 2 G G AG AA G A ; 3 A C G C C T G AA G G A G C GAG AG AG G T C T C G G G G C T G C A G G A G 
AGCAGCAGCCCCGGGGGAGTGAAG3CAGAGGAGGAGCAAAGGGCTGGGGCCGAGCCTGGC 
ACGAGACCAAGCTTGGCCAGGAGTGACGACAATGACCACGAGGTTGGGGCCCTGGGCCTG 
CAGCAGGGCAAAAGCCCAGGGGCGGGAAACCCTGAGCCTGAGCAGGACTGTGCAGCCAGG 
GGTCCGGTGAGAGCTGAAGCAGTAAGGAGGATGCCCCCAGGCGCCGAGGCTGGCAGCGTG 
GTTCTGGATGACAGTCCGGCCCCACCAGCTCCTTTTGAACACCGGGTAGTGAGCGTCAAG 
GAGACCTCCATCTCTGGGGGTTACGAGGTGTGCCAGCACGAAGTCTTGGGAGGGGGTCGG 
TTTGGCCAGGTCCACAGGTGCACAGAGAAGTCCACAGGCCTCCCACTGGCTGCCAAGATC 
ATCAAAGTGAAGAGCGCCAAGGACCGGGAGGACGTGAAGAACGAGATCAACATCATGAAC 
CAGCTCAGCCACGTGAACCTGATCC3AGCTCTATGACGCCTTCGAGAGCAAGCACAGCTGC 
ACCCTTGTCATGGAGTACGTGGACGGGGGTGAGCTCTTCGACCGGATCACAGATGAGAAG 
TACCACCTGACTGAGCTGGATGTGGTCCTGTTCACCAGGCAGATCTGTGAGGGTGTGCAT 
TACCTGCACCAGCACTACATCCTGCACCTGGACCTCAAGCCGGAGAACATATTGTGCGTC 
AATCAGACAGGACATCAAATTAAGATCATTGACTTTGGGCTGGCCAGAAGGTACAAGCCT 
CGAGAGAAGCTGAAGGTGAACTTCGGCACTCCTGAGTTGCTGGCCCCAGAAGTCGTCAAT 
TATGAGTTTGTCTCATTCCCCACAGACATGTGGAGTGTGGGAGTCATCACCTACATGCTA 
CTCAGTGGCTTGTCCCCATTTCTAGGGGAAACAGATGCAGAGACCATGAATTTCATTGTA 
AACTGTAGCTGGGATTTTGATGCTGACACCTTTGAAGGGCTCTCGGAGGAGGCCAAGGAC 
TTTGTTTCCCGGTTGCTGGTCAAAGAGAAGAGCTGCAGAATGAGTGCCACACAGTGCCTG 
AAACAGGAGTGGGTGAATAATTTGCCTGCCAAAGCTTCAAGATCCAAAACTCGTCTCAAA 
TCCCAACTACTGCTGCAGAAATACATAGCTCAAAGAAAATGGAAGAAACATTTCTATGTG 
GTGACTGCTGCCAACAGGTTAAGGAAATTTCCAACTTCTCCCTAA 

SEQ ID NO: 42_SGK088_H 

GGGGAGATGGCGCTGTTTGAGTGCCTGGTGGCGGGGCCCACTGACGTGGAGGTGGATTGG 
*-€TGTGCCGTGGCCGCCTGCTGCAGCCTGCACTGCTCAAATGCAAGATGCATTTCGATGGC 
CGCAAATGCAAGCTGCTACTTACATCTGTACATGAGGACGACAGTGGCGTCTACACCTGC 
AAGCTCAGCACGGCCAAAGATGAGCTGACCTGCAGTGCCCGGCTGACCGTGCGGCCCTCG 
TTGGCACCCCTGTTCACACGGCTGCTGGAAGATGTGGAGGTGTTGGAGGGCCGAGCTGCC 
CGTTTCGACTGCAAGATCAGTGGCACCCCGCCCCCTGTTGTTACCTC5GACTCATTTTGGC 
TGCCCCATGGAGGAGAGTGAGAACTTGCGGCTGCGGCAGGACGGGGGTCTGCACTCACTG 
CACATTGCCCATGTGGGCAGCGAGGACGAGGGGCTCTATGCGGTCAGTGCTGTTAACACC 
CATGGCCAGGCCCACTGCTCAGCCCAGCTGTATGTAGAAGAGCCCCGGACAGCCGCCTCA 
GGCCCCAGCTCGAAGCTGGAGAAGATGCCATCCATTCCCGAGGAGCCAGAGCAGGGTGAG 
CTGGAGCGGCTGTCCATTCCCGACTTCCTGCGGCCACTGCAGGACCTGGAGGTGGGACTG 
GCCAAGGAGGCCATGCTAGAGTGCCAGGTGACCGGCCTGCCCTACCCCACCATCAGCTGG 
TTCCACAATGGCCACCGCATCCAGAGCAGCGACGACCGGCGCATGACACAGTACAGGGAT 
GTCCATCGCTTGGTGTTCCCTGCCGTGGGGCCTCAGCACGCCGGTGTCTACAAGAGCGTC 
ATTGCCAACAAGCTGGGCAAAGCTGCCTGCTATGCCCACCTGTATGTCACAGATGTGGTC 
CCAGGCCCTCCAGATGGCGCCCCGCAGGTGGTGGCTGTGACGGGGAGGATGGTCACACTC 
ACATGGAACCCCCCCAGGAGTCTGGACATGGCCATCGACCCGGACTCCCTGACGTACACA 
GTGCAGCACCAGGTGCTGGGCTCGGACCAGTGGACGGCACTGGTCACAGGCCTGCGGGAG 
CCAGGGTGGGCAGCCACAGGGCTGCGTAAGGGGGTCCAGCACATCTTCCGGGTCCTGAGC 
ACCACTGTCAAGAGCAGCAGCAAGCCCTCACCCCCTTCTGAGCCTGTGCAGCTGCTGGAG 
CACGGCCCAACCCTGGA.GGAGGCCCCTGCCATGCTGGACAAACCAGACATCGTGTATGTG 
GTGGAGGGACAGCCTGCCAGCGTCACCGTCACATTCAACCATGTGGAGGCCCAGGTCGTC 
TGGAGGAGCTGGCGAGGGGCCGTCGTAGAGGCACGGGCCGGTGTGTACGAGCTGAGCCAG 
CCAGATGATGACCAGTACTGTCTTCGGATCTGCCGGGTGAGCCGCCGGGACATGGGGGCG 
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CTCACCTGCACCGCCCGAAACCGTCACGGCACACAGACCTGCTCGGTCACATTGGAGCTG 

GCAGAGGCCCCTCGGTTTGAGTCCATCATGGAGGACGTGGAGGTGGGGGCTGGGGAAACT 

GCTCGCTTTGCGGTGGTGGTCGAGGGAAAACCACTGCCGGACATCATGTGGTACAAGGAC 

GAGGTGCTGCTGACCGAGAGCAGCCATGTGAGCTTCGTGTACGAGGAGAATGAGTGCTCC 

CTGGTGGTGCTCAGCACGGGGGCCCAGGATGGAGGCGTCTACACCTGCACCGCCCAGAAC 

CTGGCGGGTGAGGTCTCCTGCAAAGCAGAGTTGGCTGTGCATTCAGCTCAGACAGCTATG 

GAGGTCGAGGGGGTCGGGGAGGATGAGGACGATCGAGGAAGGAGACTCAGCGACTTTTAT 

GACATCCACCAGGAGATCGGCAGGGGTGCTTTCTCCTACTTGCGGCGCATAGTGGAGCGT 

AGCTCCGGCCTGGAGTTTGCGGCCAAGTTCATGCCCAGGCAGGCGAAGGCAAAGGGATGA 

GCGCGTCGGGAGGCCCGGCTGCTGGCCAGGCTCCAGCACGACTGTGTCCTCTACTTCCAT 

GAGGCCTTCGAGAGGCGCCGGGGACTGGTCATTGTCACGGAGGTGTGCACAGAGGAGGTG 

CTGGAGCGAATCGCCAGGAAAGCCACCGTGTGTGAGTCTGAGATCCGGGGGTATATGGGG 

CAGGTGCTAGAGGGAATACACTACGTGCACCAGAGCCACGTGCTGCACGTCGATGTGAAG 

CCTGAGAACCTGCTGGTGTGGGATGGTGCTGCGGGCGAGCAGGAGGTGCGGATCTGTGAG 

TTTGGGAATGCCCAGGAGCTGACTCCAGGAGAGCCCCAGTACTGCCAGTATGGCACACCT 

GAGTTTGTAGCACCCGAGATTGTCAATCAGAGGCCGGTGTCTGGAGTCACTGACATGTGG 

CCTGTGGGTGTTGTTGCCTTGGTCTGTCTGACAGGAATCTCCGCGTTTGTTGGGGAAAAT 

GACCGGACAACATTGATGAACATCCGAAACTACAACGTGGCCTTGGAGGAGAGCACATTC 

GTGAGCCTGAGCAGGGAGGCCCGGGGCTTCCTCATCAAAGTGTTGGTGCAGGACCGGCTG 

AGACCTACCGCAGAAGAGACCCTAGAAGATCGTTGGTTGAAAACTCAGGCAAAGGGCGCA 

GAGGTGAGCACGGATCACCTGAAGCTATTCGTCTCGCGGCGGAGGTGGCAGCGCTCGCAG 

ATCAGCTACAAATGCCACCTGGTGCTGCGCCCCATCCCCGAGCTGCTGGGGGCCCCCCCA 

GAGCGGGTGTGGGTGACCATGCCCAGAAGGCCACCCCGCAGTGGGGGGCTCTCATCCTCC 

TCGGATTCTGAAGAGGAAGAGCTGGAAGAGCTGCCCTCAGTGCGCCGCCCACTGCAGCCC 

GAGTTCTCTGGCTCCCGGGTGTCCCTCACAGACATTCCCACTGAGGATGAGGCCCTGGGG 

ACCCCAGAGACTGGGGCTGGGACCGCCATGGACTGGCAGGAGCAGGGAAGGGCTCCCTCT 

CAGGACCAGGAGGCTCCCAGCGCAGAGGCCCTCCCCTCGCCAGGCCAGGAGCCCGCAGGT 

GGGGCTAGCCCCAGGCGGGGAGAGGTCCGCAGGGGCAGCTCGGCTGAGAGCGCCCTGCCC 

CGGGCCGGGCGGCGGGAGCTGGGCCGGGGCCTGCACAAGGGGGCGTGTGTGGAGGTGCCG 

CAGCGCCGGAGCCCCGGCCCGGGAGCCACCCGGCTGGCCGGGGGAGGCGTGGGTGAGGGC 

GAGTATGCCCAGAGGGTGCAGGGCCTGGGGCAGCGGGTGGTGGGGGGAGGGGGCGAGGAT 

GGCAAGGTCAGCGGCCTCAGGGGTCCCCTGCTGGAGAGCCTGGGGGGCCGTGCTCGGGAC 

CCCCGGATGGCACGAGCTGCCTCCAGCGAGGCAGCGCCCCACCACCAGCCCCCAGTGGAG 

AACCGGGGCCTGCAAAAGAGCAGCAGGTTCTCCGAGGGTGAGGCGGAGCCCCGGGGGCGG 

CACCGCCGAGCGGGGGCGCGCCTGGAGATCCCCGTGGCCAGGGTTGGGGCCCGTAGGCTA 

CAGGAGTCTCCTTCCCTGTGTGCCCTCAGCGAGGGCCAGGCATCCAGCCCTGCAGGGCCC 

AGCGCCCCCAAACCCAGTACCCCTAAGTCTGCAGAAGCTTGTGCCACCACACCTAGTGAT 

GCTCGGCAGCCCCCCGCACCCCAGCGTGCCCAAGAGAAGGCTCCAGAGCCCAGGCCAGAA 

CCAGTCCGAGCCTCCAAGCCTGCACCACCCCCCCAGGCCCTGCAAACCCTAGCGCTGCCC 

CTCACACCCTATGCTCAGATCATTCAGTCCCTCCAGCTGTCAGGCCACGCCCAGGGCCCC 

TCGCAGGGCCCTGCCGCGCCGCCTTCAGAGCCCAAGCCCCACGCTGCTGTCTTTGCCAGG 

GTGGCCTCCCCACCTCCGGGAGCCCCCGAGAAGCGCGTGCCCTCAGCCGGGGGTCCCCCG 

GTGCTAGCCGAGAAAGCCGGAGTTCGCAGGGTGCCCCCCAGGCCAGGCAGGAGTCTGAGT 

AGCAGCATCGAAAACTTGGAGTGGGAGGCGGTGTTCGAGGCCAAGTTCAAGCGCAGCCGC 

GAGTCGCCCCTGTCGCTGGGGCTGCGGCTGCTGAGCCGTTCGCGCTCGGAGGAGCGCGGC 

CCCTTCCGTGGGGCCGAGGAGGAGGATGGCATATACCGGCGCAGCCGGGCGGGGAGGGCG 

CTGGAGGTGGTGCGACGGCGTGAGCGGTCACGCTGGGTGCAGGAGCTCAGGGGTGTCGGA 

GAGCCTGGCCTCGTCCGCCGCCTCTCGCTGTCACTGTCCCAGCGGCTGCGGCGGACCCCT 

CCGGGGGAGCGGGAGGGGGGCTGGGAGGCGCGCGGCGGGGACGGAGAGAGGTGGGAGGGC 

GGGAGGTGGGGGGGGGGGTCGGCGGTGCTGGGGATGGGCAGGGGGCTGAGCTTGACCGTG 
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GAGGGGGTGTCCAGGOGATTGGAGGGCAGTGGGAGGAGGGAGGAGTCGGGGGGCGCGTCG 
GGGGGGAGGAGGCCGCTGTTGGGAGGGGTTCGCAGGGGCACGTCGGAGGGCGAGAGTCl'G 
GGGCGGGTTGGCGTTGGGGACAACCAGTTGGCGGC rGAGGGCGGCGCCACGACG GGTTGG 

GCCGAGTCCCTGGGCTCCGAGGCCAGCGCCACGTCGGGCTCCTCAGCCCCAGGGGAAAGC 
CGAAGCCGGCTCCGCTGGGGCTTCTCTCGGCCGCGGAAGGACAAGGGGTTATCGCCACCA 
AA.CCTCTCTGCCAGCGTCCAGGAGGAGTTGGGTCACCAGTACGTGCGCAGTGAGTCAGAC 
TTCCCCCCAGTCTTCCACATCAAACTCAAGGACCAGGTGCTGCTGGAGGGGGAGGCAGGG 
AGCCTGCTCTGCCTGCCAGCGGCCTGGCCTGCACCGCACATCTCCTGGATGAAAGACAAG 
AAGTCCTTGAGGTCAGAGCCCTCAGTGATCATCGTGTCCTGCAAAGATGGGCGGCAGCTG 
CTCAGCATCCCCCGGGCGGGCAAGCGGCACGCCGGTCTCTATGAGTGCTCGGCCACCAAC 
GTACTGGGCAGCATCACCAGCTCCTGTACCGTGGCTGTGGCCCGAGTCCCAGGAAAGCTA 
GCTCCTCCAGAGGTAACCCAGACCTACCAGGACACGGCGCTGGTGCTGTGGAAGCCGGGA 
GACAGCCGGGCACCTTGCACGTATACGCTGGAGCGGCGAGTGGATGGGGAGTCTGTGTGG 
CACCCTGTGAGCTCAGGCATCCCCGACTGTTACTACAACGTGACCCACCTGCCAGTTGGC 
GTGACTGTGAGGTTCCGTGTGGCCTGTGCCAACCGTGCTGGGCAGGGGCCCTTCAGCAAC 
TCTTCTGAGAAGGTCTTTGTCAGGGGTACTCAAGATTCTTCAGCTGTGCCATCTGCTGCC 
CACCAAGAGGCCCCTGTCACCTCAAGGCCAGCCAGGGCCCGGCCTCCTGACTCTCCTACC 
TCACTGGCCCCACCCCTAGCTCCTGCTGCCCCCACACCCCCGTCAGTCACTGTCAGCCCC 
TCATCTCCCCCCACACCTCCTAGCCAGGCCTTGTCCTCGCTCAAGGCTGTGGGTCCACCA 
CCCCAAACCCCTCCACGAAGACACAGGGGCCTGCAGGCTGCCCGGCCAGCGGAGCCCACC 
CTACCCAGTACCCACGTCACCCCAAGTGAGCCCAAGCCTTTCGTCCTTGACACTGGGACC 
CCGATCCCAGCCTCCACTCCTCAAGGGGTTAAACCAGTGTCTTCCTCTACTCCTGTGTAT 
GTGGTGACTTCCTTTGTGTCTGCACCACCAGCCCCTGAGCCCCCAGCCCCTGAGCCCCCT 
CCTGAGCCTACCAAGGTGACTGTGCAGAGCCTCAGCCCGGCCAAGGAGGTGGTCAGCTCC 
CCTGGGAGCAGTCCCCGAAGCTCTCCCAGGCCTGAGGGTACCACTCTTCGACAGGGTCCC 
CCTCAGAAACCCTACACCTTCCTGGAGGAGAAAGCCAGGGGCCGCTTTGGTGTTGTGCGA 
GCGTGCCGGGAGAATGCCACGGGGCGAACGTTCGTGGCCAAGATCGTGCCCTATGCTGCC 
"* l GAGGGCAAGCCGCGGGTCCTGCAGGAGTACGAGGTGCTGCGGACCCTGCACCACGAGCGG 
ATCATGTCCCTGCACGAGGCCTACATCACCCCTCGGTACCTCGTGCTCATTGCTGAGAGC 
TGTGGCAACCGGGAACTCCTCTGTGGGCTCAGTGACAGGTTCCGGTATTCTGAGGATGAC 
GTGGCCACTTACATGGTGCAGCTGCTACAAGGCCTGGACTACCTCCACGGCCACCACGTG 
CTCCACCTAGACATCAAGCCAGACAACCTGCTGCTGGCCCCTGACAATGCCCTCAAGATT 
GTGGACTTTGGCAGTGCCCAGCCCTACAACCCCCAGGCCCTTAGGCCCCTTGGCCACCGC 
ACGGGCACGCTGGAGTTCATGGCTCCGGAGATGGTGAAGGGAGAACCCATCGGCTCTGCC 
ACGGACATCTGGGGAGCGGGTGTGCTCACTTACATTATGCTCAGTGGACGCTCCCCGTTC 
TATGAGCCAGACCCCCAGGAAACGGAGGCTCGGATTGTGGGGGGCCGCTTTGATGCCTTC 
CAGCTGTACCCCAATACATCCCAGAGCGCCACCCTCTTCTTGCGAAAGGTTCTCTCTGTA 
CATCCCTGGAGCCGGCCCTCCGTGCAGGACTGCCTGGCCCACCCATGGTTGCAGGACGCC 
TACCTGATGAAGCTGCGCCGGCAGACGCTCACCTTCACCACCAACCGGCTCAAGGAGTTC 
CTGGGCGAGCAGCGGCGGCGCCGGGCTGAGGCTGCCACCCGCCACAAGGTGCTGCTGCGC 
TCCTACCCTGGCGGCCCCTAGAGGCACGGACCACAGCCAGGCCTCGGGCTTCAACTGGGG 
TTCCCACCAATGCCACGGGACATTCCAGGGCCCACGCTGAGCCAGGCGGGCCTGGGGCTT 
CGGTTACCACCAGCAGCAACATCTGGCTGGGCTCTTACCTCATAGACCTTCAAGGACAGA 
GACCCCAGGGCCTGGACCTGATGCCACCCCAGGCCAAAGCCAGAGTGGGAGACCCATTGG 
TCAGGCTCAGCAGGGTGGGAACAGGCAGAGGGACAAGAGGGGAATGGAGAAGTGGAGAGG 
AAAAGGAATCGAGGGAGAGGAAGGGGGAGGCTCTAGGAAGGTTCTGGGTTGGGGGTCAGT 
GCATCTCAGGGAGAACCAAGGAAGGTGGGCATGGCTGGAGAGGAGGAAAAGGAAGGAGCC 
CCAGGTGTCAGGGCAGTAGGCTGGGAGTCAGTGTGGCAAAGCGGGGGCAGGACACAGATA 
GAGTGGCAGGGGCCCAGGGCTGGGACATGAGAGAAGGCAGCGAGGCGGCAGAGGGAGAAG 
AGAGGACTCAGGTGGAGGTGGGGTGGGTCAGCTGTCAGCATCCCTCAGAGGAGAAATGTG 
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GAGAGCTGGAGGCCAGCAGTCACTCACACTCGCTCTGTCCTCCTGTCCAGTGGATACAGC 
CCTGGGCGCTCTGCTGGCCCAAGGATGTCCCCACTGCCCCTCCATGGCCTTTGGCCTTCT 
TCCCATTCATATTTATTTATTTATTGACTTTTATGAAGTTTCCCGTTCCATCCGATCCCT 
ACTGCCCATGTTGTCCTGACCATCCCTCCCAGCCATCCAGCTGTCTGTCTGTCTGCCACA 
AGGAAATAAAAATGGGAAGCAGCAAAAAAA 

SEQ ID NO: 43_AA542015_M SGK088_M 

GCCACGGACATCTGGGGAGCGGGTGTGCTCACCTACATCATGCTTAGTGGGTACTCCCCA 
TTCTATGAGCCAGACCCCGAGGAAACAGAGGCTCGGATTGTTGGGGGTCGCTTTGATGCC 
TTCCAGTTGTATCCTAAGAGATCCCAGAGTGCCACGGTCTTCTTGAGAAAGGTCGTCTGA 
GTACATCCCTGGAGCCGGCCCTCTCTGCAGGACTGCTTGGCCCACCCATGGCTGCAAGAT 
GCCTACCTGATGAAGCTGCGCCGCCAGACAGTCACCTTCACCACGAACCGGCTCAAGGAA 
TTCCTGGGCGAGCAGCGGCGACGTCGGGCTGAGGCTGCTACCCGTCACAAGGTGCTGCTG 
CGCTCCTACCCTGGCAGCCCCTAGGTGGCACAGACCGCAGCCCGGCCACGGGCTTCAACT 
TGGGTTCTCACTCGCGCTGCCAAGGGACATTCCAGAGCCCATGCTGAGCTGGACAGGCAG 
GGGCTTCAGATACCAGCAGCAGCAGCAGCAGCAGCAGCAGCAACATCTGGCTGGGCTATT 
ACCTCATGGACCTAAGAGGACAAGGCCCTGGGGCTTCAGCCGAATGTCACCCCGGCCATA 
ACCAGAGCAGGAGACCCACTGGCCAGGCTGGGCAAGGGTGAGAGCAGAAAGAGGCAAAGA 
GGGAGTTGGGAAGTGAAGAATGAGACGGAGGATAGAGAGGGAGGAGTTTGAGGAAGGTTC 
TAGGCTGGAGTGGAATGCTATATCTCAGGGAGAAGCCAGAAGGGGACATGGCTGAAGAGG 
AAGAAGGACCCTGTGATGTGGGAATGTGGTGGAGAGGAGGACTGGACATAGAGAGTGTGC 
CAGGAGCCAGAGCAGAGACATAAGGGAGGGCAGAAGGGTAGAAGGCAACAGGAGTGGGCT 
CAGGGGTGGCAGGGCAGGCCAGCAGCTGCATCTTCAGAAAGAGAGAGGAGAAAGGCAAAG 
AGACGAAAGGCCGCTCCAGCTGGTCTCCTGTCCCAGCCGATGCAGTTCTGGGCGTTGTCC 
ACTGGCCCAGGGATGTCCTCACTGGTCCTCCATGGGCTTTGCCGTGCTTCCGATTTGTAT 
TTATTTATTTATTGCCTTTTGTGGAGTTTCCTTTCTATCCAGTCCCTAGTGCCTATGTTG 
TCCCGACCATCCCCCTTCAGTCAGGCAGCTGTCTGTGCAGCTGTGTGTGTGTCTGTGACA 
AGGAAAATAAAACAAAACAAAACAAAACAAAAGAAACAAAAAAAAAAGAG^ 

SEQ ID NO: 44_R19772_H 

ATGAAGGGCGGCGACAGGGCTTAGAGCCGAGGTGGGTCTTTGGGGTGGCTCTTTGGTAAG 
TGCTGCTGTTGCTTCCCGTGTAGAGATGGATACTCTCATTCCTCAAGCGAGAATGGAGGC 
AAGTCCGAGTCCGTAGCCAACCTGCAGGCCCAGCCCTCCCTGAACTTCATGCACAGTTGC 
CCGGGTCCCAAGCGCTCCACCAACAGTCTTAAGAAGTGGCTGACGAGTCCTGTGCGTCGG 
CTCAACAGGGGGAAGGCAGATGGAAACATCAAAAAGCAGAAGAAAGTTCGCGATGGTCGG 
AAGAGCTTTGACCTGGGATCTCCCAAGCCTGGGGATGAAACAACCCCTCAGGGAGACAGG 
GCTGATGAGAGCAAGAAAGGTTGGGGTGAAGATGAGGCGGATGAAGAGTCACACACAGCG 
CTCCCAGCAGCTATGAAGATTTTTGACAACGACCCTACACAGGATGAAATGTGCTCGTCT 
TTGGTAGGAGCCCGGCAGGCTTCGACTGAAGTACCTAGTGCTGCAGACGTTGTCAATGGA 
ATAGAAAAGTTGGTCAAAAACAAGGTGAGTCTAGAAGGAAGCTCATACCGGGGGAGCTTG 
AAAGAGGCTGCAGGCTGCGTGAATGAGGGGATGGCCCCACCCACACCTCGTAAAAATCCA 
GAAGAAGAACAGAAAGCCAAGGCCGTGAGAGGCAGGATGTTTGTGCTGAATGAGGTGGTA 
CAGACAGAGAAAGAGTATGTCAAGGATGTGGGCATTGTGGTGGAGGGGTTCATGAAGAGA 
ATAGAAGAAAAGGGTGTCCCTGAGGATATGGGAGGAAAGGAGAAAATGGTGTTTGGAAAT 
ATTCATCAGATTTATGAGTGGGATAAGGATTTTTTGCTGGGGGAAGTGGAAAAGTGTATC 
CAGGAGCAAGACAGATTGGCACAGCTCTTTATTAAGCACGAGCGGAAGCTGCACATCTAC 
GTGTGGTATTGTCAGAATAAGCCGCGCTCAGAGTACATCGTTGCTGAGTATGACGCCTAC 
TTTGAGGAGGTAAAACAGGAGATAAATCAGAGGCTGACACTGAGTGACTTCCTCATCAAG 
CCCATTCAGAGAATAACAAAATACCAGTTGCTCCTCAAGGACTTCCTGAGATACAGTGAG 
AAGGCTGGTTTGGAGTGTTCAGATATCGAGAAAGCAGTGGAGTTAATGTGCCTTGTTCCC 
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AAACGCTGCAATGACATGATGAATCTAGGACGTCTGCAGGGCTTTGAGGGCACTCTGACI 
GCTCAGGGGAAGCTACTGCAGCAGGACACATTCTATGTGATCGAGCTGGATGCAGGCATG 
CAGTCrCGGACCAAAGAGAGGCGCGTGTTCCTCTTCGAGCAGATTGTCATCTTCAGTGAA 
CTGCTCAGGAAGGGATCCCTCACCCCTGGCTACATGTTCAAAAGGAGCATCAAGATGAAT 
TACTTGGTCCTGGAGGAGAATGTGGACAATGATCCCTGCAAGTTTGCACTCATGAACAGA 
GAGAGTTCTGAGAGGGTTGTTCTGCAAGCCGCCAACGCTGACATCCAGCAGGCCTGGGTG 
CAGGACATCAATCAAGTCTTAGAAACACAGCGAGACTTTTTGAATGCACTGCAATCGCCC 
ATTGAGTATCAACGGAAAGAAAGGAGCACAGCTGTGATGAGGTCTCAACCTGCCAGGCTT 
CCGCAAGCCAGCCCCAGGCCCTACTCCTCTGTTCCTGCGGGCTCAGAGAAGCCCCCAAAG 
GGCTGCAGCTATAACCCACCTCTGGCTCCCCTGAAGATATCTACCTCCAATGGCAGTCCA 
GGG TTTGAAT AC C AC CAGCCTGGGGACAAGTTCGAAGCC AG CAAGAACG A CCTGGGAGGC 
TGCAATGGGACCTCGTCCATGGCCGTGATCAAAGATTACTATGCACTGAAGGAGAATGAA 
ATCTGTGTGAGCCAAGGTGAGGTGGTCCAGGTCCTCGCCGTCAACCAGCAGAACATGTGT 
CTGGTGTACCAGCCTGCCAGCGACCATTCCCCCGCCGCCGAGGGCTGGGTCCCAGGCAGC 
ATCCTGGCGCCCCTCACCAAAGCCACAGCAGCAGAAAGTAGTGACGGGAGCATCAAGAAG 
TCATGTTCATGGCATACTCTACGCATGAGAAAGCGGGCGGAAGTGGAGAACACGGGTAAA 
AATGAAGCCACAGGGCCTCGTAAACCCAAGGATATTCTGGGCAACAAAGTCTCTGTTAAA 
GAGACGAACAGTTCCGAGGAATCAGAGTGTGATGATCTTGACCCTAATACTAGCATGGAG 
ATCTTAAATCCAAATTTCATCCAAGAAGTGGCCCCAGAATTCCTTGTGCCCTTGGTGGAT 
GTGACCTGCTTGCTTGGGGACACAGTGATACTGCAGTGCAAAGTCTGTGGGCGGCCAAAG 
CCCACCATCACTTGGAAGGGTCCAGACCAGAACATCCTTGACACTGATAACAGCTCAGCC 
ACATACACGGTCTCCTCTTGTGATTCTGGAGAAATCACCCTGAAGATCTGTAATCTGATG 
CCCCAAGACAGTGGGATTTATACCTGCATAGCAACAAATGACCACGGGACCACATCAACG 
TCTGCAACAGTCAAAGTGCAAGGTGTTCCAGCAGCCCCTAACCGCCCCATTGCCCAGGAG 
AGAAGCTGCACCTCCGTGATTCTCCGCTGGCTGCCCCCCTCCAGCACAGGAAACTGCACT 
ATTTCTGGTTACACTGTGGAGTACAGAGAGGAAGGTTCTCAGATCTGGCAGCAGTCAGTG 
GCTTCGACCTTGGACACTTACCTCGTCATCGAAGACCTTAGTCCCGGGTGTCCTTATCAG 
— ■ TTCAGAGTCAGTGCCAGTAACCCCTGGGGAATCAGCCTTCCCAGCGAGCCCTCGGAGTTT 
GTGCGACTTCCAGAATACGATGCTGCTGCTGATGGTGCCACCATTTCTTGGAAGGAAAAT 
TTTGACTCAGCTTACACTGAGCTGAATGAAATTGGAAGAGGCCGTTTCTCTATA.GTAAAG 
AAATGCATTCACAAAGCTACCCGCAAAGATGTGGCTGTGAAATTTGTTAACAAAAAAATG 
AAGAAGAAAGAACAGGCTGCCCACGAGGCTGCCCTGCTTCAGCACCTACAGCACCCCCAG 
TACATCACTCTCCATGACACCTATGAGTCCCCCACATCCTACATCCTGATCTTGGAACTG 
ATGGATGATGGCCGGCTCTTAGACTACCTTATGAATCATGATGAACTGATGGAGGAAAAA 
GTAGCTTTCTATATCCGAGACATCATGGAGGCTCTGCAGTACCTTCACAACTGCAGGGTT 
GCACATTTGGACATAAAGCCTGAAAACCTGCTCATTGACCTACGGATTCCAGTGCCTCGA 
GTGAAGCTCATTGACTTGGAGGATGCTGTCCAGATCTCGGGTCACTTCCACATTCACCAC 
CTGCTGGGGAACCCTGAGTTTGCTGCCCCAGAAGTCATTCAAGGCATCCCCGTCTCCCTG 
GGGACAGACATCTGGAGCATCGGGGTTCTGACATATGTCATGCTGAGTGGGGTCTCCCCC 
TTCTTGGATGAGAGCAAAGAGGAGACATGTATCAACGTATGCAGGGTGGATTTCAGCTTC 
CCCCATGAATACTTCTGTGGTGTGAGCAATGCTGCCAGAGATTTCATCAATGTGATCTTA 
CAGGAAGATTTTCGGAGGCGGCCCACAGCAGCCACATGCTTGCAGCATCCATGGCTGCAG 
CCCCATAATGGCAGCTACTCTAAGATCCCCCTGGACACCTCCCGCCTAGCATGCTTCATA 
GAACGTCGCAAGCACCAGAATGATGTGCGGCCTATCCCCAATGTCAAGAGCTACATTGTC 
AACCGGGTGAACCAAGGGACGTAG 

SEQ ID NO: 4 5_5R72__8_2_H 

CGCCGCTGTTTGTCCTCGCGCGGCCCCGTCCACTGCCCTGCGGTTGCTCTGCGGGCTGAA 
AAGTTTCTCCCGGTGCAGAATTCCGGGCTCAGCGACAGCCTGCGCCGAGTGTGCGCACCT 
GTCGGAGACCCGCCAGTCCGCCGGCCCCGGCTTTGTTCGTGCGGAACTGTAGTGGTGAGA 
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AAAACTCCATGTCTGGGCACGCCTGGCTGATCTTCACCTCTTTCTTCTAGGACCTTCCTC 
TGGGCTGTCACGTGTGAATATGTGTCTAGTGCATCCTTAACCTGAGGACTTCACCAGTTC 
GAAATTACAGTTTTCACCATCAAGTACCTTATCCTTTTTGGCCTGGTTTTCTTCCTCAAA 
- CAGTGGAAACATTTTTAAAGTTGCTTTTGTTGCAGAGTTAAACAAATGGCTGATAGTGGC 
TTAGATAAAAAATCCACAAAATGCCCCGACTGTTCATCTGCTTCTCAGAAAGATGTACTT 
TGTGTATGTTCCAGCAAAACAAGGGTTCCTCCAGTTTTGGTGGTGGAAATGTCACAGACA 
TGAAGCATTGGTAGTGCAGAATCTTTAATTTCACTGGAGAGAAAAAAAGAAAAAAATATG 
AACAGAGATATAACCTCCAGGAAAGATTTGCCCTCAAGAAGGTCAAATGTAGAGAGAAAA 
GCATCTCAGCAACAATGGGGTCGGGGCAACTTTACAGAAGGAAAAGTTCGTCACATAAGG 
ATTGAGAATGGAGCTGCTATTGAGGAAATGTATACCTTTGGAAGAATATTGGGAAAAGGG 
AGCTTTGGAATAGTCATTGAAGCGACAGACAAGGAAACAGAAACGAAGTGGGCAATTAAA 
AAAGTGAACAAAGAAAAGGCTGGAAGCTCTGCTGTGAAGTTACTTGAACGAGAGGTGAAC 
ATTCTGAAAAGTGTAAAACATGAACACATCATACATCTGGAACAAGTATTTGAAACGCCA 
AAGAAAATGTACCTTGTGATGGAGCTTTGTGAGGATGGAGAACTCAAAGAAATTCTGGAT 
AGGAAAGGGCATTTCTCAGAGAATGAGACAAGGTGGATCATTCAAAGTCTCGCATCAGCT 
ATAGCATATCTTCACAATAATGATATTGTACATAGAGATCTGAAACTGGAAAATATAATG 
GTTAAAAGCAGTCTTATTGATGATAACAATGAAATAAACTTAAACATAAAGGTGACTGAT 
TTTGGCTTAGCGGTGAAGAAGCAAAGTAGGAGTGAAGCCATGCTGCAGGCCACATGTGGG 
ACTCCTATCTATATGGCCCCTGAAGTTATCAGTGCCCACGACTATAGCGAGCAGTGTGAG 
ATTTGGAGCATAGGAGTCGTAATGTACATGTTATTACGTGGAGAACCAGCGTTTTTGGCA 
AGCTCAGAAGCGAAGCTTTTTGAGTTAATAAGAAAAGGAGAACTACATTTTGAAAATGCA 
GTCTGGAATTCCATAAGTGACTGTGCTAAAAGTGTTTTGAAACAACTTATGAAAGTAGAT 
CCTGCTCACAGAATCACAGCTAAGGAACTACTAGATAAGCAGTGGTTAACAGGCAATAAA 
CTTTCTTCGGTGAGACCAACCAATGTATTAGAGATGATGAAGGAATGGAAAAATAACCCA 
GAAAGTGTTGAGGAAAACACAACAGAAGAGAAGAATAAGCCGTCCACTGAAGAAAAGTTG 
AAAAGTTACCAACCCTGGGGAAATGTCCCTGAGACCAATTACACTTCAGATGAAGAGGAG 
GAAAAACAGTCTACTGCTTATGAAAAGCAATTTCCTGCAACCAGTAAGGACAACTTTGAT 
ATGTGCAGTTCAAGTTTCACATCTAGCAAACTCCTTCCAGCTGAAATCAAGGGAGAAATG 
GAGAAAACCCCTGTGACTCCAAGCCAAGGAACAGCAA-CCAAGTACCCTGCTAAATCCGGC 
GCCCTGTCCAGAACCAAAAAGAAACTCTAAGGTTCCCTCCAGTGTTGGACAGTACAAAAA 
CAAAGCTGCTCTTGTTAGCACTTTGATGAGGGGGTAGGAGGGGAAGAAGACAGCCCTATG 
CTGAGCTTGTAGCCTTTTAGCTCCACAGAGCCCCGCCATGTGTTTGCACCAGCTTAAAAT 
TGAAGCTGCTTATCTCGAAAGCAGCATAAGCTGCACATGGCATTAAAGGACAGCGACCAG 
TAGGCTTGGCAGTGGGCTGCAGTGGAAATCAACTCAAGATGTACACGAAGGTTTTTTAGG 
GGGGCAGATACCTTCAATTTAAGGCTGTGGGCACACTTGCTCATTTTTACTTCAAATTCT 
TATGTTTAGGCACAGCTATTTATAGGGGAAAACAAGAGGCCAAATATAGTAATGGAGGTG 
CCAAATAATTATGTGCACTTTGCACTAGAAGACTTTGTTAGAAAATTACTAATAAAGTTG 
CCATACGTATTACAGCAGAAGTGCTTCAGTCATTCACATGTGTTCGTGAGATTTTAGGTT 
GCTATAGATTGTTTAAGACAGCTTATTTTAAATGTAGAAAAATAGGAGATTTTGTAACTG 
CTTGCCATTAACTTGCTGCTAAATTCGGAATGTATTGATTAAATCAATAAAAAAGAGATG 

TTACTC 

SEQ ID NO: 4 6_SGK3 0 9_H 

GGGTCCGCAGCGGGGGCTCACAGGCCGTCGTCACTCCCGTAGGTAGATGGGGCCGTCAGG 
GCAGGGCCGGGGGACACCGCTCCCTGTGGCTGGCGGATGCAGTGGCTAGCGGGGGGGCTT 
AAGGACGAAACCAACATGAGTGGGGGAGGGGAGCAGGCCGACATCCTGCCGGCCAACTAC 
GTGGTCAAGGATCGCTGGAAGGTGCTGAAAAAGATCGGGGGCGGGGGCTTTGGTGAGATC 
TACGAGGCCATGGACCTGCTGACCAGGGAGAATGTGGCCCTCAAGGTGGAGTCAGCCCAG 
CAGCCCAAGCAGGTCCTCAAGATGGAGGTGGCCGTGCTCAAGAAGTTGCAAGGTTCGGGC 
CTCGGGCAGGGGGATGGGAAGGAAGAGATGATGAAGCCAGGGGGTAAGAGAGGGAAGGAC 
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CATGTGTGCAGGTTCATTGGCTGTGGCAGGAACGAGAAGTTTAACTATGTAGTGATGCAG 

rTCCAGGGCCGGAACCTGGCCGACGTGCGCCGTAGCCAGCCGCGAGGCACCTTCACGCTG 

AGCACCACATTGCGGCTGGGCAAGGAGATCTTGGAGTCCATCGAGGCCATCCACTCTGTG 

GGGTTGCTGCACCGTGAGATCAAGGCTTCAAAGTTTGGCATGGGCAGGCTGCCGTGCACG 

TACAGGAAGTGCTATATGCTGGACTTCGGGCTGGCCCGGCAGTACACCAACACCACGGGG 

GATGTGCGGCCCCCTCGGAATGTGGCCGGGTTTCGAGGAACGGTTCGCTATGCCTCAGTC 

AATGCCCACAAGAACCGGGAGATGGGCCGCCACGACGACCTGTGGTCCCTCTTCTACATG 

CTGGTGGAGTTTGCAGTGGGCCAGCTGCCCTGGAGGAAGATCAAGGACAAGGAACAGGTA 

GGGATGATCAAGGAGAAGTATGAGCACCGGATGCTGCTGAAGCACATGCCGTCAGAGTTC 

CACCTCTTCCTGGACGACATTGCCAGCCTCGACTACTTCACCAAGCCCGACTACCAGTTG 

ATCATGTCAGTGTTTGAGAACAGCATGAAGGAGAGGGGCATTGCCGAGAATGAGGCCTTT 

GACTGGGAGAAGGCAGGCACCGATGCCCTCCTGTCCACGAGCACCTCTACCCCGCCCCCA 

GCAGAACACCCGGCAGACGGCAGCCATGTTTGGGGTGGTCAATGTGACGCCAGTGCCTGG 

GGACCTGCTCCGGGAGAACACCGCGGATGTGCTACAGGGAGAGCACCTGAGTGACCAGGA 

GAATGCACCCCCAATTCTGCCCGGGAGGGCCTCTGAGGGGCTGGGCCACAGTCCCCACCT 

TGTCCCCCACCCCGGGGGTCCTGAGGCTGAAGTCTGGGAGGAGACAGATGTCAACCGGAA 

CAAACTCCGGATCAACATCGGCAAAGTAACTGCCGCCAGGGCGAAGGGCGTGGGTGGCCT 

TTTCTCTCACCCCCGATTCCCAGCCTTGTGCCCCTGCCCTGTTCCTCCTAAGCACCCTGT 

CCCCCGCCAATCTCCCTGCTTGCCCGGCCTCTGTTTCCGGTCCCCTCCCCGGCACTAGCC 

TCGCTGTGTCTTCCATCATCATCATCCTCTGTCTCCTTCACACTGAGGAGACCATCCGCC 

SEQ ID NO: 4 7_AA2 3 44 5 1_H 

GGCGCGCCAGATATCACACGTGCCAAGGGGCTGGCTCAGCGGCGGCGGCGGCAGGAGGGG 

GAGCAGGTGCTGGCACAAGAGCAGCGGCTTGGGGGAGCCGGCAGCAGCAGTAACAGCAGC 

AGCAGCCGCCGCCGCCGCCGCCAGTAAACGCGGACCGTACCCCAGGGGACTACCCAGCCG 

GCCGGCCCTGGAAGCCGCGCTCGGGTCCCGCCGCAGTCGGCGGTGGGGGATGGGCAGGCA 

GTGGCGGTCCCGCGTGCGGAGGGTTAACCCCCGCCGGTCCCGGTCCTGAGCTGGACCAGA 

GCCCTCCTCCAGAAACCCCTGCGTCCGCCACGGCCCAGGTTAAATGGAAACCACCCTTGG 

GAACTGGATGCCTGTGTAGCTGTTCTACCATATCAGTGTATTGCAATGAGTGGGGGAGGA 

G AG C AG C TGG ATAT C CTG AG TGTTGG AAT CCT AG TG AAA GAAAGATGGAAAGTGTTG AG A 

AAGATTGGGGGTGGGGGCTTTGGAGAAATTTACGATGCCTTGGACATGCTCACCAGGGAA 

AATGTTGCACTGAAGGTGGAATCAGCTCAACAACCAAAACAAGTTCTGAAAATGGAAGTT 

GCTGTTTTGAAAAAGCTGCAAGGGAAAGACCATGTTTGTAGATTTATTGGCTGTGGGAGG 

AATGATCGATTCAACTATGTGGTCATGCAGTTGCAGGGTCGGAATCTGGCAGATCTTCGC 

CGTAGCCAGTCCCGAGGCACATTCACCATTAGTACCACTCTCCGGCTGGGTAGACAGATT 

TTGGAGTCTATTGAAAGCATTCATTCTGTGGGATCTTGNCATCGAGACATCAAACCGTCG 

AACTTCGCTATGGGTCGCTTTCCTAGTACATGTAGGAAATGTTACATGCTTGATTTTGGC 

TTGGCTCGACAATTTACCAATTCCTGTGGTGACGTCAGACCACCTCGAGCTGTGGCAGGT 

TTTCGAGGGACAGTTCGTTATGCATCAATGAACGCACATCGGAACAGGGAAATGGGAAGA 

CATGATGACCTTTGGTCCTTATTCTACATGTTGGTGGAGTTTGTGGTTGGTCAGCTGCCC 

TGGAGAAAAATAAAGGACAAGGAGCAAGTAGGCTCTATTAAGGAGAGATATGACCACAGG 

CTCATGTTGAAACATCTCCCTCCAGAATTCAGCATCTTTCTAGACCATATCTCTTCTTTG 

GATTATTTTACAAAAGCAGACTACGAGCTTCTTACATCCGTGTTTGACAATAGCATCAAG 

ACTTTTGGAGTAATTGAGAGTGAGGGTTTTGACTGGGAGAAGACTGGAAATGATGGCTGG 

GTAACAAGCACCAGTAGTTGTACGAGGCGTGAGTTGCACACTCGCTTGACCGCTGGTGCA 

ATTGGAATTGCCAATGGTAGTGGCATCGCTGGAGACTTGGTTCGAGAAAATACAGATGAG 

GTATTTGCAGATGAACAGGTTAGGGATGGAGAAAATGGCATGCCTGTTGGTGTGTGAGGA 

GATAAATTGCCTGGATGTCTGGGAGAGGGCGGTCCCCAGGAGAAGGATGTTTGGGAAGAG 

ATGGATGCCAACAAAAACAAGATAAAGCTTGGAATTTGTAAGGCTGCTACTGAAGAGGAG 

AACAGCCATGGCCAGGCAAATGGTCTTCTCAATGCTCCAAGCCTTGGGTCACCAATTCGT 
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GTCCGCTCAGAGATTACTCAGCCAGACAGAGATATTCCACTGGTGCGAAAGTTACGTTCC 
ATTCACAGCTTTGAGCTGGAAAAACGTCTGACCCTGGAGCCAAAGCCAGACACTGACAAG 
TTCCTTGAGACCTGGTATAAAATAGTGTATTTTTCTTTTTAAAGCTTCTAAGGTACCATT 
ATTATTGTTGTCATTGTTGTTATTATTATTGTATATTTCTGTTACATAAAGTCTTTCAAA 
TAAGAAATCCTTGCATTTTTGTAACACTGAGTCTATTCAGCTCCAATTTTCATCCATGTT 
TTTAATTATTATTATCCTGATTCTTAATTATTATAAATTCTATAGCATATCCTTTGGCTT 
TGGAAGCTGAGCAGTAAGAGCTGATGACTTCCTAACACTAGGTACAAGTTAAATGAACAT 
TTTTACAGTAACTTTGTTTAGAAAGTAATCTCTTCCACACAACAGTGTAGTGGTGGAGAG 
GGCATGATAAAGATGGCATTAGGCAGAGATGAGGGGAATAGATAAAGGAGGGGAAAAAGT 
AATTCATACACAAGGGACGGTGAGTTCAATTCACTTTAGTGAAGACCCTCTAGGAGTAAG 
ATACTGTGGGAAAACAGATACCAATAAGTATATCATGCTTGCCCTAGAGAGTTTGGAATC 
TACCTAGAGAGAAAGGAAGGTGAAACTTGAGAGATCTATATACATAGGTAAAGATTGTAG 
TGCATGGTTTTGAGGCACATTATCCCTACAACAAATTTTGATAAGAGAAGAC 

SEQ ID NO: 4 8__AA4 3 5956_H 

ACTTTTACTATATTCTTTGAGATGACTGTTTTTGATTTAGAGGCGAAATGAGCACGTGGT 
GGCTCAAATCTCCTTATGGATAGTGTTTGTTGCTTCGAGGTTTTCATGTTTCAACTTTTG 
CGGGGCCTGGCGTACATCCACCACCAACACGTTGTTCACAGGGACCTGAAAGGTCAGAAC 
TTACTCATCAGTCACCTGGGAGAGCTCAAAGTGGGTGATTTTGGTCTTGCCCGGGCCAAG 
TCCATTCCCAGGCAGACATAGTGTTCAGAAGTCGTGAGCCTGTGGTAGGGGCCGGCTGAT 
GCTTTGCTGGGAGCCAGTGAATATTCGTCTGAGGTGGAGATATGGGGTGCAGGGTGCATG 
TTTATTGAAATGTTCCAGGGTCAACCTTTGTTTCCTGGGGTTTCCAACATCCTTGAACAG 
CTGGAGAAAATCTGGGAGGTGCTGGGAGTCCCTACAGAGGATACTTGGCCGGGAGTCTGG 
AAGCTACCTAACTACAATCCAGAATGGTTCCCACTGCCTACGCCTCGAAGCCTTCATGTT 
GTCTGGAACAGGCTGGGCAGGGTTCCTGAAGCTGAAGACCTGGCCTCCGAGATGCTAAAA 
GGCTTTCCCAGAGACCGCGTCTCCGCCCAGGAAGCACTTGTTCATGATTATTTCAGCGCG 
CTGCCATCTCAGCTGTACCAGCTTCCTGATGAGGAGTCTTTGTTTACAGTTTCAGGAGTG 
AGGCTAAAGCCAGAAATGTGTGACCTTTTGGCCTCGTACCAGAAAGGTCACCACCCAGGC 
CAGTTTAGCAAATGCTGGTGAAAAGAAAGGGCGAGATCACCAAGGTTGTTCGAGGGCTGT 
ATTTCTGCAGTTTCGGTTTTCATTTGGTTCAGGTTACTAAGAAGCTTCAAATGTAACTCC 
ATACTGAACAAGGGGCTTTATGTCCTCACGTATGACCTGGAATAGTTTAAATATGGTGTT 
CAAGGCAATAGTACATAATAGTGGAAGAAAATTCAGTGGAAGGTTATTGGTATTGTCATT 
TGCATAGAATTTAAGTGATTGATTTAAAAAAACTGGACATAAACTAAGTGTAAGAAG 

SEQ ID NO: 4 9_AA62 6 8 5 9_H 

AAATGGAGTTGCTGATGGAGTGATCAAAAGGGTATTATGGCAAACACTTCAAGGTGTTAA 
TTTCTGTGATATACATAAGTGTATTGAGAGAGATATAAAACCTGAAAATATTGTAATAAG 
TAAGCAAGGAATAATGAAGATTTGTGAGTTGGGGTTTGGACAAATTGTGATTGCAGGAGA 
TGCCTACACCGATTATGTAGGTACGAGATGGTACCGAGCTGCTGAACTTCTTGTGGGAGA 
TAGTGAGTATGGTTGTTGAGTGGATATATGGGGTATTGGTTGTGTTTTTGGAGAGCTCCT 
GAGAGGCCAGCGAGTGTGGGGTGGAAAATCAGATGTGGAGGAAGTTTATGTGATAATCAG 
AACAGTAGGAAAATTAATCCCAAGACATCAATCAATGTTTAAAAGTAACGGGTTTTTCGA 
TGGCATCAGTATACCTGAGCCAGAAGACATGGAAACTCTTGAGGAAAAGTTCTCAGATGT 
TCATCCTGTGGGTGTGAACTTCATGAAGGGGTGTCTGAAGATGAATGGAGATGACAGATT 
AACCTGTTCCCAACTCCTGGAGAGCTCCTACTTTGATTCTTTTCAAGAGGCCCAAATTAA 
AAGAAAAGCACGTAATGAAGGAAGAAACAGAAGACGCCAACAGAATCAACTGTTGCCTCT 
CATACCAGGAAGCCACATGTCCGGCACACGTGATGGAAGAAAACAAGTGGTCGAGTTAAA 
ATTTGATCACCTTCCAAACATTTAGGAAAATGTTCTTTCAAGTGCAAAGTAATTTAATAT 
GTACACATTTTGTACAAGTGAGATAGGAATTCTCAGTGTTTCAAATGGAAATGAGGCATA 
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TGAAAATTAAGATGCCTTCTAGA.^TTGGTTTGCTCTGATCATTQCTGATTCCTTTCCCCA 
TGGTTTTACAT 

SEC ID NO: 5C_AA0 6 1 7 9 7__M 

GAAAATAGCCGTGCGGGAAATCCGTATGCTGAAGTTGAAAGACCCAAACCTCGTGAACCl' 

CATCGAGGTGTTCAGAAGAAAGAGAAAGATGCATCTAGTTTTTGAGTACTGTGATCACAC 

ACTGTTAAACGAGCTGGAGAGAAACCCAAACGGAGTTTCTGATGGAGTGATTAAAAGTGT 

GCTATGGCAAACCCTTCAAGCCCTTAACTTCTGTCACAAGCACAATTGTATTCATCGGGA 

TGTAAAACCTGAAAACATCCTAATAACCAAGCAAGGGATGATAAAGATTTGTGACTTTGG 

ATTTGGACGAATTCTAATTCCAGGAGAGGGCTACACAGAGTATGTTGCCACCAGGTGGTA 

CCGAGCCCCCGAACTTCTGGTGGGAGAGAGGAAGTACGGTTGCTCTGTAGACGTGTGGGC 

CGTGGGGTGTGTTTTTGCAGAGCTCGTGAGGGGTCAGGCACTGTGGCCGGGAAAATCCGA 

CGTGGAGCAGCTTTAGCTGATCATCAGGACGTTGGGGAAGCTGATTGCAAGACACCAGTC 

TATCTTTAGGAGTAACCAGTTTTTCGGCGGCATCAGCATACCTGAACCAGAGGACATGGA 

GACTCTTGAAGAAAAATTCTCAAATGTTGAGCGTGTGGGTTTAAGTTTCATGAAGGGATG 

GGTGAAGATGAATCCTGATGAGAGGCTGACGTGTGCCCAGGTGCTGGACAGTGCCTACTT 

TG AG TCTTTTCAAGAGGATGAAATGAAAAGAAAAG CCGGCAGTGAGGGG AG AAGGCGAAG 

GCGCCAGCAGAATCAACTGCTGCCTCTTATTGCTGGAAGCCACATGTCCCCCACACCTGA 

TGGAAGGAAACAAGTCGTCCAGTTAAAGTTCGATCATCTTCCAAACATTTAGGGGACTCA 

TCCTTCCCAGCACATCGTTTTAATATTGTGTACATAGGAATAAGAGGGGAATCCTCAGCA 

TCTCAAATACAGTGAGCGACGTGAAGACGAGGGCACCTGTAATCAGCACGGGGTGCTGCC 

CTGTGGTTTTTCCACGCCAGCTCCATCTCGTAAAACATTCTCTTTAAATGTTGCAGTATC 

AAAATGGCACATCCGAAAGAGATGCTTCCAGTTTCAGCAGAGCCGGGCTTCCTCAGGGAA 

TCGGTACTGTGCATCTGTGGACTTATGCTCCGAGCTAGGGAAAGATTTCGACGTAGCCGT 

CTCAGTTCAGCCGACGAGTGGTGTCCTGAAGCAGAGCGAGATCTGCTGGGTGGTGTTTGT 

GGAGGGGATGGCCCTGAGCCCTCTCACTGGAGTTTCTTGTGCGTGCAGCGAGGTCTTAGT 

TTAGACTAGATTTGTGTTATTGTGGCATGGCAATCGTGAAAGGTGGTCTAGGTTTACCCT 

TGACTCGAGAGCAGATGGTAGTCTCCTTGTCGTGAGGAGGTGACAAGTCTGCTTCTAAAA 

CG AACTAG AG AAAATT CC AAACGTG AC C AG TTAGTGG AC AG AC TACAAGGAATCG AC C AC 

CATACCACAGTAACGCCCCTGGATCCCTGGCTGCCCACCCACTCTAAGGCTATCCTGGTT 

CACCATGGTTTCTCTTTCTTTTCTTCTTTTTTAATCTATTTGTACATATGAGAAAGAGGC 

AGAGGGGCGAGAGAAACCTCGTGTGTGAAAATCAAAAGACAAGCAGGAGGCCAGCCTAAG 

CTACATAGCAAGGCCTTTTCTCTACACCCATTCTCTAAGGTTGCTTAAACCCAAGTCCCT 

GCTGCTGATTGTATAAACTATGAATAAGTTCTACATATGTAGGACATATTGTTGTCATTG 

TTGAAATATCTAAGGATCTTGGTAGAAGCAGAAGTGTTCTAAATATTCTCCACACTGGTG 

AGTATCTTGGCATTTCATTTCTGACCTCATCACAGATGAACACATCAAAGGATGAGTATG 

TATCACTTTGCATCTTAGAATTCTACCTGTTTTAGCTGCGTTAAACCTTGTGAAAGGGCG 

GGGCCATAACTGAACCTGTGGAGTTCTTGCCTGTGTGCAGGAAACCCTCTGGTTTTGTCT 

CCAGCATGGAAGAAAACAGCTATAGTCACACCTACCTGAAAGTAGAAATTCAAAGTCACT 

GTCCTTGACTACATATGCAGTCCAAGGCCACGCTGGGCTACACTTCTCCAGGCATGAAGG 

TCCGTGTTTGTATCAAGGGGCAGGAAAGGAGAGTCCAAGGTCAAGGCCAGCCGAGGCTGC 

AT AGTG AGTTG AGG C T CTT C AGCAAAA GAAAAG C AAA C TAAT A GG AG TCGTTGAAGG TAG 

CCACCGGCCATTTCTCTAAATATCATTCTGCTGAAAAGGGGGCTTAGTTTAGTTTAGAAT 

GCATTAATGTATGTAGAAGCTGGGCTATTTCAGATTATTTGAAATTGTAGCTATTGTTAA 

TTAGCACTTAATAACTAACTAGCATTATGGTAGTCTAAACTATTAGAGTTTACTACAAAG 

AGGTTTTGATTGAATTATATTAAACATATAATATGGATTTTAAAAATTTAAGATGTTTAA 

GAAAGCTATATAAAGATTAAACATTTTTGTGGCTGTATATTTGTGTATATACCTTGGTTG 

TTCTTTAAATTATTTTAATAAAAGCCAGAAACATT 
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SEQ ID NO: 5 1_AA3 975 53_H 

ATGCCCAATTCAGAGAGACATGGGGGCAAGAAGGACGGGAGTGGAGGAGCTTCTGGAACT 

TTGCAGCCGTCATCGGGAGGCGGCAGCTCTAACAGCAGAGAGCGTCACCGCTTGGTATCG 

AAGCACAAGCGGCATAAGTCCAAACAGTCCAAAGACATGGGGTTGGTGACCCCCGAAGCA 

GCATCCCTGGGCACAGTTATCAAACCTTTGGTGGAGTATGATGATATCAGCTCTGATTCC 

GACACCTTCTCCGATGACATGGCCTTCAAACTAGACCGAAGGGAGAACGACGAACGTCGT 

GGATCAGATCGGAGCGACCGCCTGCACAAACATCGTGAGCACCAGCACAGGCGTTCCCGG 

GACTTACTAAAAGCTAAACAGACGGAAAAAGAAAAAAGCCAAGAAGTCTCCAGGAAGTCG 

GGATCGATGAAGGACCGGATATCGGGAAGTTGAAAGCGTTCGAATGAGGAGACTGATGAC 

TATGGGAAGGCGCAGGTAGCCAAAAGCAGCAGCAAGGAATCCAGGTCATCCAAGCTCCAC 

AAGGAGAAGACCAGGAAAGAACGGGAGCTGAAGTCTGGGCACAAAGACCGGAGTAAAAGT 

CATCGAAAAAGGGAAACACCCAAAAGTTACAAAACAGTGGACAGCCCAAAACGGAGATCC 

AGGAGCCCCCACAGGAAGTGGTCTGACAGCTCCAAACAAGATGATAGCCCCTCGGGAGCT 

TCTTATGGCCAAGATTATGACCTTAGTCCCTCACGATCTCATACCTCGAGCAATTATGAC 

TCCTACAAGAAAAGTCCTGGAAGTACGTCGAGAAGGCAGTCGGTCAGTCCCCCTTACAAG 

GAGCCTTCGGCCTACCAGTCCAGCACCCGGTCACCGAGCCCCTACAGTAGGCGACAGAGA 

TCTGTCAGTCCCTATAGCAGGAGACGGTCGTCCAGCTACGAAAGAAGTGGCTCTTACAGC 

GGGCGATCGCCCAGTCCCTATGGTCGAAGGCGGTCCAGCAGCCCTTTCCTGAGCAAGCGG 

TCTCTGAGTCGGAGTCCACTCCCCAGTAGGAAATCCATGAAGTGCAGAAGTAGAAGTGCT 

GCATATTCAAGACATTCATCTTCTCATAGTAAAAAGAAGAGATCCAGTTCACGCAGTCGT 

CATTCCAGTATCTCACCTGTCAGGCTTCCACTTAATTCCAGTCTGGGAGCTGAACTCAGT 

AGGAAAAAGAAGGAAAGAGCAGCTGCTGCTGCTGCAGCAAAGATGGATGGAAAGGAGTCC 

AAGGGTTCACCTGTATTTTTGCCTAGAAAAGAGAACAGTTCAGTAGAGGCTAAGGATTCA 

GGTTTGGAGTCTAAAAAGTTACCCAGAAGTGTAAAATTGGAAAAATCTGCCCCAGATACT 

GAACTGGTGAATGTAACACATCTAAACACAGAGGTAAAAAATTCTTCAGATACAGGGAAA 

GTAAAGTTGGATGAGAACTCCGAGAAGCATCTTGTTAAAGATTTGAAAGCACAGGGAACA 

AGAGACTCTAAACCCATAGCACTGAAAGAGGAGATTGTTACTCCAAAGGAGACAGAAAGA 

TCAGAAAAGGAGACCCCTCCACCTCTTCCCACAATTGCTTCTCCCCCACCCCCTCTACCA 

ACTACTACCCCTCCACGTCAGACACCCCCTTTGCCACCTTTGCCTCCAATACCAGCTCTT 

CCACAGCAACGACCTCTGGGTCCTTCTCAGCCAGCATTTAGTCAGGTTCCTGCTTCCAGT 

ACTTCAACTTTGCCCCCTTCTACTCACTCAAAGACATCTGCTGTGTCCTCTCAGGCAAAT 

TCTCAGCCCGCTGTACAGGTTTCTGTGAAGACTCAAGTATCTGTAACAGCTGCTATTCCA 

CACCTGAAAACTTCAACGTTGCCTCCTTTGCCCCTGCCACCCTTATTACGTGGAGGTGAT 

GACATGGATAGTCCAAAAGAAACTCTTCCTTCAAAACCTGTGAAGAAAGAGAAGGAACAG 

AGGACACGTCACTTACTCACAGACCTTCCTCTCCCTCCAGAGCTCCCTGGTGGAGATCTG 

TCTCCCCCAGACTCTCCAGAACCAAAGGCAATCACACCACCTCAGCAACCATATAAAAAG 

AGACCAAAAATTTGTTGTCCTCGTTATGGAGAAAGAAGACAAACAGAAAGCGACTGGGGG 

AAACGCTGTGTGGACAAGTTTGACATTATTGGGATTATTGGAGAAGGAACCTATGGCCAA 

GTATATAAAGCCAGGGACAAAGACACAGGAGAACTAGTGGCTCTGAAGAAGGTGAGACTA 

GACAATGAGAAAGAGGGCTTCCCAATCACAGCCATTCGTGAAATCAAAATCCTTCGTCAG 

TTAATGCACCGAAGTGTTGTTAACATGAAGGAAATTGTCACAGATAAACAAGATGCAGTG 

GATTTCAAGAAGGACAAAGGTGCCTTTTACCTTGTATTTGAGTATATGGACCATGACTTA 

ATGGGACTGCTAGAATCTGGTTTGGTGCACTTTTCTGAGGACCATATCAAGTCGTTCATG 

AAACAGCTAATGGAAGGATTGGAATACTGTCACAAAAAGAATTTCCTGCATCGGGATATT 

AAGTGTTCTAACATTTTGCTGAATAACAGTGGGCAAATCAAACTAGCAGATTTTGGACTT 

GCTCGGCTCTATAACTGTGAAGAGAGTCGCCCTTACACAAACAAAGTCATTACTTTGTGG 

TACCGACCTCCAGAACTAGTGCTAGGAGAGGAACGTTACACACCAGGCATAGATGTTTGG 

AGGTGTGGATGTATTCTTGGGGAAGTATTCACAAAGAAGCCTATTTTTCAAGGGAATCTG 

GAACTGGCTCAGCTAGAACTGATCAGCCGACTTTGTGGTAGCCCTTGTCCAGCTGTGTGG 

CCTGATGTTATCAAACTGCCCTACTTCAACACCATGAAACCGAAGAAGCAATATCGAAGG 
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CGTCTACGAGAAGAATTCTCTTTCATTC rTTCTGCAGCACTTGATTTATTGGACCACATG 

CTGACACTAGATCCTAGTAAGCGGTGCACAGCTGAACAGACCCTACAGAGCGACTTCCTT 

AAAGATGTCGAACTCAGCAAAATGGCTCCTCCAGACCTCCCCCACTGGCAGGATTGCCAT 

GAGTTGTGGAGTAAGAAACGGCGACGTCAGCGACAAAGTGGTGTTGTAGTCGAAGAGGGA 

CCTCCATCCAAAACTTCTCGAAAAGAAACTACCTCAGGGACAAGTACTGAGCCTGTGAAG 

AAGAGCAGCCCAGCACCACCTCAGCCTGGTCGTGGCAAGGTGGAGTCTGGGGGTGGGGAT 

GCAATAGGCCTTGCTGACATCACACAACAGCTGAATCAAAGTGAATTGGCAGTGTTATTA 

AACCTGCTGCAGAGCCAAACCGACCTGAGCATCCCTCAAATGGCACAGCTGCTTAACATC 

CAGTGCAACCCAGAGATGCAGGAGCAGCTGGAAGCCCTGAACCAATCCATCAGTGGCCTG 

ACGGAAGCTACTTCCCAGCAGCAGGACTCAGAGACCATGGCCCCAGAGGAGTCTTTGAAG 

GAAGCACCCTCTGCCCCAGTGATCCTGCCTTCAGCAGAACAGATGACCCTTGAAGCTTCA 

AGCACACCAGCTGACATGCAGAATATATTGGCAGTTCTCTTGAGTCAGCTGATGAAAACC 

CAAGAGCCAGCAGGCAGTGTGGAGGAAAACAACAGTGACAAGAACAGTGGGCCACAGGGG 

CCCCGAAGAACTCCCACAATGCCACAGGAGGAGGCAGCAGCATGTCCTCCTCACATTCTT 

CCACCAGAGAAGAGGCCCCCTGAGCCCCCCGGACCTCCACCGCCGCCACCTCCACCCCCT 

CTGGTTGAAGGCGATCTTTCCAGCGCCCCCCAGGAGTTGAACCCAGCCGTGACAGCCGCG 

TTGCTGGAACTTTTATCCCAGCCTGAAGGAGAGCGTCCTGGCCACCTGCCACATGAGCAG 

GAGGCCTTGAGACCAATGGAGTACTCCACCCGACCGGGTGGAAAGAGGACTTATGGAAAC 

ACTGATGGGCCTGAAAGAGGGTTCAGTGCCATTGACACTGATGAACGAAACTCTGGTCCA 

GCCTTGACAGAATCGTTGGTCCAGACCCTGGTGAAGAACAGGACCTTCTCAGGCTCTCTG 

AGCCAGCTTGGGGAGTGCAGCAGTTAGCAGGGCACAGGGTCAGTGCAGTTTGCAGGGGAG 

CAGGACCTCCGTTTTGGCAGGGTCCCCTTAGCGTTACACCCGGTGGTCGGGCAACCATTG 

CTGAAGGCTGAGGGAAGCAGCAATTCTGTGGTACATGCAGAGACCT^AATTGCAAAACTAT 

GGGGAGCTGGGGCCAGGAACCACTGGGGCCAGCAGGTCAGGAGCAGGCCTTCACTGGGGG 

GGCCCAACTCAGTCTTCTGCTTATGGAAAACTCTATCGGGGGCCTACAAGAGTCCCAGCA 

AGAGGGGGAAGAGGGAGAGGAGTTGGTTAGTAA 

" SEQ ID NO: 5 2_AA7 8 92 3 9_H 
TGAAAATGGAGATGTATGAAACCCTTGGAAAAGTGGGAGAGGGAAGTTACGGAACAGTCA 
T G AAATG T AAA C AT AAG AAT AC TG GGC AG AT AGTGGCCATTAAGATATTTTATG AG AG AC 
CAGAACAATCTGTCAACAAAATTGCGATGAGAGAAATAAAGTTTCTAAAGCAATTTCATC 
ACGAAAACCTGGTCAATCTGATTGAAGTTTTTAGACAGAAAAAGAAAATTCATTTGGTAT 
TTGAATTTATTGACCACACAGTATTAGATGAGTTACAACATTATTGTCATGGACTAGAGA 
GTAAGCGACTTAGAAAATACCTCTTCCAGATCCTTCGAGCAATTGACTATCTTCACAGTA 
ATAATGTAATCATTCATCGAGATATAAAACCTGAGAATATTTTAGTATCCCAGTCAGGAA 
TTACT7VAGCTCTGTGATTTTGGTTTTGCACGAACACTAGCAGCTCCTGGGGACATTTATA 
CGGACTATGTGGCCACACGCTGGTATAGAGCTCCCGAATTAGTATTAAAAGATACTTCTT 
ATGGAAAGTATGTGCCTGTGGATATCTGGGCTTTGGGCTGTATGATCATTGAGATGGCCA 
CTGGAAATCCCTATCTTCCTAGTAGTTCTGATTTGGATTTACTCCATAAAATTGTTTTGA 
AAGTGNGATTCATGCCAGAACTGAAAGCTAAATTACTGCAGGAAGCAAAAGTCAATTCAT 
TAATAAAGCCAAAAGAGAGTTCTAAAGAAAATGAACTCAGGAAAGATGAAAGAAAAACAG 
TTTATACCAATACACTGCTAAGTAGTTCAGTTTTGGGAAAGGAAATAGAAAAAGAGAAAA 
AGCCCAAGGAGATCAAAGTCAGAGTTATTAAAGTCAAAGGAGGAAGAGGAGATATCTCAG 
AACCAAAAAAGAAAGAGTATGAAGGTGGACTTGGTCAACAGGATGCAAATGAlAAATGTTC 
ATCCTATGTCTCCAGATACAAAACTTGTAACCATTGAACCACCAAACCCTATCAATCCCA 
GCACTAACTGTAATGGCTTGAAAGAAAATCCACATTGCGGAGGTTCTGTGACAATGCCAC 
CCATCAATCTAACTAACAGTAATTTGATGGCTGCAAATCTCAGTTCAAATCTCTTTCACC 
C C AGT G TG A GG TT AA C TG AAAG AG C AAAAAAG AG A CG C ACTTCTTC A CAATCT A TTGG AC 
AAGTTATGCCTAATAGCAGGCAAGAGGATCCAGGTCCTATTCAAAGCCAAATGGAGAAGG 
GTATATTTAATGAGCGAACAGGTCACAGTGACCAAATGGCAAATGAGAACAAAAGGAAGC 
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TGAATTTTTCCAGATCTGACAGGAAAGAATTCCATTTTCCAGAATTGCCTGTCACAATAC 
AGTCAAAAGATACAAAAGGAATGGAAGTTAAACAGATAAAAATGCTGAAGAGGGAGTCAA 
AGAAAACAGAGTCATCTAAGATACCAACTTTACTTAACGTGGATCAAAATCAAGAJvAAAC 
AAGAGTTTATTCCCTTATCTCTGCTGTCTGCCTGCTGTCCTATTTTCACAAATATTTGCT 
CTCAGCTAACTATCAGGGTGGAGATGGCCATTGCGAGGGGAAGAATTTGAAGAGAAACAG 
GTTTTTTTTCTGGTAGTGTCTTTTCTTTTACATAGTCCAAAAAATACAAGATGACAACTC 
TTCCCGTTTTATTTATCTACAATAGAAGTGTGATGTGAGTTGTTGTTAAGACAGCCATCC 
ATGTGCATGAGCATCATCCAGCTTTTTTTGTTAGCAAAACATTTACTGTTTTCTTTTCCC 
TTTTAAGACTCTGTTGATGTGATAATTTGATTTGGAATTATAAAGTCATCTCTTCTCTGC 
CTTGAA 

SEQ ID NO: 53_AA124976_M 

CTGGCAGATATAGTTCATGCTTGTTTACAAATTGATCCTGCTGAGAGGACATCATCTACT 

GATCTTTTGCGTCACGATTACTTTACTAGAGATGGATTTATTGAGAAATTCATACCAGAG 

CTGAGAGCTAAATTATTACAGGAAGCAAAGGTTAATTCATTTATAAAGCCAAAAGAGAAT 

TTTAAAGAAAATGAACCTGTGAGAGATGAGAAGAAATCAGTTTTTACCAACACCCTGCTC 

TATGGAAATCCATCACTTTATGGCAAGGAAGTGGACAGAGACAAAAGGGCCAAGGAGCTC 

AAAGTCAGAGTCATTAAGGCCAAAGGGGGCAAAGGAGATGTCCCAGACCAGAAGAAGCCA 

GAGTATGAAGGCGACCACCGCCAGCAGGGCACAGCTGATGACACACAGCCCTCATCACTG 

GACAAGAAGCCTTCTGTCTTGGAACTGACAAACCCTCTCAATCCCAGTGAGAATTCTGAC 

GGTGTCAAAGAAGACCCACACGCTGGGGGTTGTATGATAATGCCACCTATCAACCTGACA 

AGCAGTAATTTGTTGGCCGCAAATCTCAGTTCAAACCTTTCCCACCCCAATTCACGGTTA 

ACTGAAAGAACAAAAAAGAGACGCACTTCTTCACAAACTATTGGACAGACTTTGTCTAAT 

AGCAGACAAGAGGACACAGGTCCCACACAAGTCCAAACAGAGAAAGGTGCATTTAATGAG 

CGAACAGGTCAGAATGACCAAATATCGAGTGGGAACAAAAGAAAGCTGAATTTTCCCAAA 

TGCGACAGGAAAGAATTCCATTTCCCTGAACTGCCATTCACAGTGCAGGCGAAGGAGATG 

AAAGGGATGGAAGTTAAACAGATAAAAGTGCTGAAGAGAGAATCAAAGAAAACAGATTCA 

TCTAAAATACCAACTTTACTTAGTATGGACCCAAATCAAGAAAAACAAGAGGGTGGAGAT 

GGCGATTGTGAGGGGAAGAATTTGAAGAGGAACAGATTTTTTTTTTCCCGATAGTGCTTT 

GTCTTTTAAGTAATCTTAAAAATACAAGCTTGACAATTCCTTCCTTTTTATTTTATATAC 

ACTAGAATGTACATAGGTTGCTGCTAAGATAGCCACCCATCCCATCTGCATCAACATCAT 

CTATTTTTTTGGTTTTGCTAGCAAAATTTTCACAATTTTTCTCTATCTTCCAAAAACTGT 

TATTTTGATGCTGTGATTTGAAATTATAAAGTCACCTCCTCTGTCTGCTTCCTTCCTTGC 

CATGATTACTGAGTGGGTAGTCACATGATGTGCCCTGCTCGCACTGCTCTCAGACTGCTG 

AGACTCAAACCTCATAAGCCAGGGGTCTCCTGGGAAGCACTGGCCTGTTCAAGTGGATGC 

TCGATGAACCTTCTTATCTGTTGTCTTAGTAACCACTCGTTGCCATCACATGATGAAAGA 

CATTCTATTGTCCCCAGTGAAGCATTTATAGTACTTACATAACATGTTACAGTGATATGA 

TGTTCCTAGGTTAAACTCCTTGAGATGAAACTATTTCCTGCATTCTCTGACTCCCCTAGT 

CTAATAGTTCCTTCCATTTAGCCAGAAGAATTTCCTGAAGAAGCGATGCACAACCTGGGA 

AAGGTTTACTTTCTATCCTGGGCTGTTTTCTGTTGCTAAATAATATAGACTGGGTAGTTA 

GTTAACAT 

SEQ ID NO: 54_AA57563 5_M CCRK_M 

AGCGCCTCAGGCCAGCTCAAGATAGCTGACTTTGGCCTGGCCCGGGTCTTCTCTCCGGAT 
GGTGGTCGCCTCTACACACATCAGGTGGCCACCAGGTGGTACCGAGCTCCTGAACTCCTG 
TATGGCGCTCGGCAGTATGACCAGGGCGTTGACCTATGGGCTGTGGGCTGCATCATGGGA 
GAGCTGTTGAATGGGTCCCCCCTGTTCCCGGGCGAAAACGACATTGAACAACTGTGCTGT 
GTGCTTCGCATCCTGGGTACCCCGAGTCCTCGAGTCTGGCCGGAGATCACAGAGCTGCCT 
GACTACAACAAGATCTCCTTCGAGGAGCAGGCACCAGTGCCCCTGGAGGAGGTGCTGCCT 
GATGCCTCTCCCCAGGCCTTGGACCTGCTGGGCCAGTTCCTCCTCTACCCTCCACGACAG 
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CGTATTGCAGCCTCCCAGGCCCTTCTGCATCAGTACTTCTTCACAGCGCCTCTGCCTGCC 

CATCCATCCGAGCTGCCAATTCCTCAGCGCCCAGGGGGACCTGCACCCAAGGCTCACCCA 

GGGCCCCCCCATGTCCACGACTTCCATGTGGATCGACCTATTGAGGAGTCACTGTTGAAC 

CCAGAACTGATTCGGCCCTTCATCCCAGAGGGGTGAGATGCTGGTCCAGGCCTTCCTGC7 

CGCCCTAGGAGCACCTCTTTCTGATTTGCCTCCATGGCCTCCCCACGGCTATATATACCA 

CACCTGGTCCTGCTCCTGAGTGTGCTTGAGGGCTGGGCTCTGGGAGGCAGAACCGTGAGA 

TGTTCATGGCAGCAGAGAAAGAGAGTCACGTGCTACAGAGAAAGCCTGCAGAAAGTGCTA 

GGTGTGTGCTTCTCCAGGGCCAGCCCTCAGTGGTGCCACCCGGCCTTAGAGATGATTGTC 

AGGCTGTGTCGGCTCTTCAAGGACATTGGTAGTACAGCAGCAGCTGGTGGAAGCACAGAG 

TATAAGGTGTCTTCATACTGGGGACACAGGTGGGAAGTCAGACATGTTTTAGTTTTGGTT 

CCAGTGGGTCAGGATTTGAGGTTCATATAAAAGCCCTGGGTGTTTCTGTCTAATTGCACG 

TTGTCTGTTGCTGTTAGGGAAAGGACAATGGTGGGCCTTGATTCACAGGGGTCAGGTACT 

CAGAAGGGGCGTCCTGTGAAGGGCATTTGGGTCCTCAGGCTTCCCATGCTATTCACGGGA 

CTTGAGTGCTCATTTGGGAGCGAGGGTCCAGAAGCTGAGGCCCAGGGATGGACAGTCCAG 

TTCCCGAAGGGGACTTGCCAGATGTCGGGTGGGTCAGTCAGTGAGCCTGAGGCTGCCTTG 

GAGATGCGGAAGCAGGCATTCCTGGAATGCAGTCAGTAAATAAATTGCAGTGTGACTCAG 

SEQ ID NO: 5 5_AA6 3 1 9 9 0_H 

GAACAAGAATAACAGAATAAGGAAGAAAATGTCATGATTACCTCAATAAGTACAGAGAAA 

TCTGGTCACACTCAGTATCGATTCATGATTACAACTCTTCAATACTATGGCGGCCGAGGA 

GGGAAGACGGGAGTTTGGCGACATTTCTCGGGCGAAGGGGCATTTGCTTTTGCGGAGATG 

CGGCATTCCAAAAGAACTCACTGTCCTGATTGGGATAGCAGAGAAAGCTGGGGACATGAA 

AGCTATCGTGGAAGTCACAAGCGGAAGAGGAGATCTCATAGTAGCACACAAGAGAACAGG 

CATTGTAAACCACATCAGCAGTTTAAAGAATCTGATTGTCATTATTTAGAAGCAAGGTCC 

TTGAATG AG CGAGATTATCGGG A CCGGAG ATA CGTTGACGAATACAGGAATGACT A CTGT 

GAAGGATATGTTCCTAGACATTATCACAGAGACATTGAAAGCGGGTATCGAATCCACTGC 

AGTAAATCTTCAGTCCGCAGGAGGAGAAGCAGTGCTAAAAGGAAGCGCAATAGACACTGT 

•TCAAGTCATCAGTCACGTTCGNATGAAATCGTGGACACTTTGGGTGAAGGAGCCTTTGGC 

AAAGTTGTAGAGTGCATTGATCATGGCATGGATGGCATGCATGTAGCAGTGAAAATCGTA 

AAAAATGTAGG G CGTT AC CG TG AAG CAGCTCGTTC AG AAATCCAAGTATTAG AG CACTT A 

AATAGTACTGATCCCAATAGTGTCTTCCGATGTGTCCAGATGCTAGAATGGTTTGATCAT 

CATGGTCATGTTTGTATTGTGTTTGAACTACTGGGACTTAGTACTTACGATTTCATTAAA 

GAAAACAGCTTTCTGCCATTTCAAATTGACCACATCAGGCAGATGGCGTATCAGATCTGC 

CAGTCAATAAATTTTTTACATCATAATAAATTAACCCATACAGATCTGAAGCCTGAAAAT 

ATTTTGTTTGTGAAGTCTGACTATGTAGTCAAATATAATTCTAAAATGAAACGTGATGAA 

CG C ACACTGAAAAAC AC AG ATA TCAAAGTTGTTG AC TTTGGAAGTGCAACGTATGATG AT 

GAACATCACAGTACTTTGGTGTCTACCCGGCACTACAGAGCTCCCGAGGTCATTTTGGCT 

TTAGGTTGGTCTCAGCCTTGTGATGTTTGGAGCATAGGTTGCATTCTTATTGAATATTAC 

CTTGGTTTCACAGTCTTTCAGACTCATGATAGTAAAGAGCACCTGGCAATGATGGAACGA 

ATATTAGGACCCATACCACAACACATGATTCAGAAAACAAGAAAACGCAAGTATTTTCAC 

CATAACCAGCTAGATTGGGATGAACACAGTTCTGCTGGTAGATATGTTAGGAGACGCTGC 

AAACCGTTGAAGGAATTTATGCTTTGTCATGATGAAGAACATGAGAAACTGTTTGACCTG 

GTTCGAAGAATGTTAGAATATGATCCAACTCAAAGAATTACCTTGGATGAAGCATTGCAG 

CATCCTTTCTTTGACTTATTAAAAAAGAAATGAAATGGGAATCAGTGGTCTTACTATATA 

CTTCTCTAGAAGAGATTACTTAAGACTGTGTCAGTCAACTAAACATTCTAATATTTTTGT 

AAACATTAAATTATTTTGTACAGTTAAGTGTAAATATTGTATGTTTTGTATCAATAGCAT 

AATTAACTTGTTAAGCAAGTATGGTCTTGATAATGCATTAGAAAAATTAAAATTAATTTT 

TCTTTTTGAAATTACCATTTTTAAATACCTTTGAAATATCCTTTGTGTCCAGTGATAAAT 

GTGATTGATCTTGCCTTTTGTACATGGAGGTCACCTCTGAAGTGATTTTTTTTGAGTAAA 

AGGAAATGTTGACTACTTTATATTCTTAAAGGAATATTCTTTATATACTTCAAATTTAGA 
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ACTTAACTTTAAAAGTTTTTCTTCTGTAATTGTTGAACGGGTGATTATTATTAACTCTAG 
ATAAGCAGGTACTAGAAACCAAAACTCAGAAAATGTTTACTGTTAGAATTCTATTAAAT1 , 
TTAAGTGTTGTATTCTTTTTCATTGGGTGATGTCAGGGTGATAACCAGACATTCATGGAA 
AGGCATGCAGTTTGTCCATTGTGACAGTTTGTTTAATAAAACCACATACACACTTTATT1' 
AAGATTAAAATCTAACTGGAAAGTCAGCTTGGAAAATGGACATTTCCAAGTATGTT TGGT 
GAGTCACAGATATAAAAATAGAAATTCTGATGAGAGGTTTCAGTTTTTAATACCAAGTCC 
TTAGGAGTCTTAACATTGGCCAGCATCTGTTTATCAAATGACATAAATACGTAAACCTAT 
AAGAATTAAGTTTATTAATTAGGCAATTTATGTCTGTGATAATTCTTACGGGAGAAAGAG 
GATTTGATTGGAAAGCAGTTTGGGAAGAAAGTGCTGCTGAAATTTCCAGAATTTAATTGA 
TTGGTTACATAAACTTTTTGACTTCAAT 

SEQ ID NO: 56_AA557536__H 

AGTAAGGCCCCGCGGGCGTCCTGGCCGCCATGTGCACCGTAGTGGACCCTCGCATTGTCC 

GGAGATACCTACTCAGGCGGCAGCTCGGGCAGGGGAGAACATTCCGGGAAATCACGCTGC 

TCCAGGTGAGTGGCCTGGGCCCTCCAGTCCAATCCCCTTGCCCAGGTACAGATCTCTCGA 

GACAGGAGAGAAACTGGCCTTCTTGGGCCCCAGAGCACAGCCCCTCCTGGCCTTCGAGGC 

GCCTCCGACTCTCTCCCCAGGAGTTTGGGGACCATCCCAACATCATCAGGCTCCTTGACG 

TGATCCGGGCAGAGAACGACAGGGACATTTACCTGGTGTTTGAGTTTATGGACAGTGACC 

TGAACGCAGTCATCCGGAAGGGCGGCCTGCTGCAGGACGTGCACGTGCGCTCCATGTTCT 

ACCAGCTCCTGCGGGCCAGGCGGTTCCTCCACTGGGGGCACGTTGTGCAGCGGGACCAGA 

AGCCGTCCAATGTGCTCCTGGATGCCAACTGCACAGTGAAGCTGTGTGACTTTGGCCTGG 

CCCGCTCCCTGGGCGACCTCCCTGAGGGGCCTGAGGACCAGGCCGTGACAGAGTACGTGG 

CCACACGCTGGTACCGAGCACCGGAGGTGCTGCTCTCTTCGCACCGCTACACCGCTTCCT 

GCCCCAGATACACCCTTGGGGTGGACATGTGGAGTCTGGGCTGTATCCTGGGGGAGATGG 

TGCGGGGGAGACCCCTGTTCCCCGGCACGTCCACCCTCCACCAGCTGGAGCTGATCCTGG 

AGACCATCCCACCGCGATCTGAGGAGXXXAGGCCACGACAGACGCTGGATGCCCTCCTAG 

CGCCAGACACCTCCCCAGAGGCCTTGGACGTCCTTAGGCGACTCCTGGTGTTGGCGCCGG 

ACAAGCGGTTAAGCGCGACCCAGGCACTGCAGCACCCCTACGTGCAGAGGTTCCACTGCC 

CCAGCGACGAGTGGGCACGAGAGGCAGATGTGCGGCCGCGGGCACACGAAGGGGTCGAGG 

TCTCTGTGCCTGAGTACCGCAGCCGCGTCTATCAGATGATCCTGGAGTGTGGAGGCAGGA 

GCGGCACCTCGAGAGAGAAGGGCCCGGAGGGTGTCTCCCCAAGCCAGGCACACCTGCACA 

AACCCAGAGCCGACCCTCAGCTGCCTTCTAGGACACCTGTGCAGGGTCCCAGACCCAGGG 

CCCAGAGCAGCCCAGGCCATGACCCTGCCGAGCACGAGTCCCCCCGTGCAGCCAAGAACG 

TTCCCAGGCAGAACTCCGCTCCCCTGCTCCAAACTGCTCTCCTAGGGAATGGGGA7=LAGGC 

CCCCTGGGGCGAAGGAAGCGCCCCCCTTGACACTCTCGCTGGTGAAGCCAAGCGGGAGGG 

GAGCTGCGCCCTCCCTGACCTCCCAGGCTGCGGCTCAGGTGGGCAACCAGGCCCTGATCC 

GGGGTGACTGGAACCGGGGCGGTGGGGTGAGGGTGGCCAGCGTACAACAGGTCCCTCCCC 

GGCTTCCTCGGGAGGGGCGGCCCGGGCGGAGGATGTTCAGCAGCTGTGCGTTGCAGGGTG 

CCCAGGGGGGTGCCAGGGGTTTGGTTGGAGGGTACTCCCAAGCCTAGGGGAGTGTCTGGG 

ACTCGGCACTGGGCCACCTGCCCCTGCTGGAGGGGCACCATGTGTGAGCCGCCCTACTCC 

CTTCAGCTGGCCCTCTGTTCCTGGCCCAGCNGCTTCCCCAGACCCGTCTGCAGTGTCCTG 

CACCCGTTAGCGCTCGCTGGTTTGGCTGGCCCGTTGAAGTTGCAGGGAGCTTGCGCGGGT 
CTCCTCGGGGGAGCAGATGAGGGCCCTGCCC 

SEQ ID NO: 5 7_N28606_H, MOK_H 

ATGAAGAACTATAAAGCAATTGGCAAAATAGGAGAGGGAACGTTTTCTGAAGTTATGAAG 
ATGCAAAGCCTGAGAGATGGAAACTACTATGCATGTAAACAAATGAAGCAGCGCTTTGAA 
AGTATTGAGCAAGTCAACAACCTAGGAGAGATCCAAGCAGTGAGGCGCGTGAATCCGGAC 
CCAAACATTGTTATGTTGCATGAAGTGGTTTTTGACAGAAAATCTGGTTGTGTTGCACTA 
ATATGTGAACTTATGGAGATGAATATTTATGAGCTAATAGGAGGGAGAAGATACCCATTA 
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TACAT 3 T C CA C C 3 G 3 TG G TAC 3 GGG C IT C C 3 GA 3 TGT CT C C TCA C TG ATG G G TT CTAC ACG 

TA3AAGATGGAG3T3TGGAGG3G3GG3T3TGTGTTGTACGAGATCGCCAGTCTGGAGCCC 
GTGTTTGCTGGAG T AAA. T G AA C T G G A C G AAA. TGT C AAAAA. T C 3 A C G A T G T C A. T C G G C A C A 
3 3 3 G 3 T C A. G AA G A T C 3 T 3 A. 3 C AA, G TT 3 A AA. C A 3 T 3 G A G A G 3 T A T G AA. T T T T G A T T T T C C ~ 
TTTAAAAAGGGATGAGGAATA3CTCTA3TAACAACCAATTTGTCCCCACAATGCCT3TCG 
CTCCTGCACGCAATGGTGGCCTATGATCG3GATGAGAGAATCGCCGCCCACCAGGCCCTG 
3AGCACCCCTACTTCCAA.GAACAGAGGAAAACAGAGAAGCGGGCTCTGGGCAGCCACAGA 
AAAGCTGGCTTTCCGGAGCACCCTGTGGCACCGGAACCACTCAGTAACAGCTGCCAGATT 
TCCAAGGAGGGCAGAAAGCAGAAACAGT333TAAAGCAAGAGGAGGACCGTCCCAAGAGA 
CGAGGACCGGCCTATGTCATGGAACTG333AAACTAAAGCTTTCGGGAGTGGTCAGACTG 
TCGTCTTACTCCAGCCCCA3GCTGCAGTCCGTGCTTGGATCTGGAACAAATGGAAGAGTG 
CCGGTGCTGAGACCCTTGAAGTGCATCCCTGCGAGCAAGAAGACAGATCCGCAGAAGGAC 
CTTAAGCCTGCCCCGGAGCAGTGTCGCCTGCCCACCATAGTGCGGAAAGGCGGAAGATAA 



SEQ ID NO: 58_AB02 3 153_H , I CK_K 

ATGAATAGATACACAACAATCAGGCAGCTCGGGGATGGAACCTACGGTTCCGTCCTGCTG 

GGAAGAAGCATTGAGTCTGGGGAGCTGATCGCTATTAAuAAAAATGAAAAGAAAATTTTAT 

TCCTGGGAGGAATGCATGAACCAACGGGAGGTTAAGTCTTTAAAGAAGCTCAACCATGCC 

AATGTAGTCAAATTAAAAGAAGTTATCAGGGAAAATGATCATCTTTATTTTATCTTCGAG 

TACATGAAGGAAAATCTTTACCAGCTCATTAAAGAGAGAAATAAGTTGTTTCCTGAGTCT 

GCTATAAGGAATATCATGTATCAGATATTACAAGGACTCGCATTTATTCACAAACTCGGC 

TTCTTTCATCGAGACTTAAAGCCTGAGAACCTCCTCTGCATGGGACCAGAACTTGTGAAA 

ATTGCAGACTTTGGTTTGGCCCGAGAAATACGATCAAAACCTCCATATACAGATTATGTA 

TCTACCAGATGGTACAGGGCTCCAGAAGTACTCCTGAGGTCTA.CC/ACTACAGCTCCCCC 

ATTGACGTCTGGGCGGTGGGCTGCATCATGGCAGAAGTTTACACCCTCAGGCCACTCTTC 

CCTGGAGCCAGTGAAATTGACACAATA.TTCAAAATTTGCCAAGTGCTGGGGACACCAAAA 

AAGACTGACTGGCCTGAAGGCTATCAACTTTCAAGTGCAATGAACTTCCGTTGGCCACAG 

TGTGTACCCAATAACTTAAAGACCTTGATTC3CAATGCTAGCAGTGAAGCAGTCCAGCTC 

CTGAGAGACATGCTTCAGTGGGATCCCAAGAAACGACCAACAGCTAGTCAGGCACTTCGA 

TATCCTTACTTCCAAGTTGGACACCCACTAGGCAGCACCACACAAAACCTTCAGGATTCA 

GAAAAACCACAGAAAGGCATCCTGGAAAGGGCAGGCCCACCTCCTTATATTAAGCCA.GTC 

CCACCTGCCCAGCCACCAGCCAAGCCACACACACGAATTTCTTCACGACAGCATCAAGC3 

AGCCAGCCCCCTCTGCATCTCACGTACCCCTACAAAGCAGAGGTCTCCAGGACAGATCA3 

CCAAGCCATCTCCAGGAGGACAAGCCAAGCCCGTTGCTTTTCCCATCCCTCCACAACAAG 

CATCCACAGTCGAAAATCACAGCTGGCCTGGAGCACAAAAATGGTGAGATAAAGCCAAAG 

AGTAGGAGAAGGTGGGGTCTTATTTCCAGGTCAACAAAGGATTCAGATGATTGGGCTGAC 

TTGGATGACTTGGATTTCAGTCCATCCCTCAGCAGGATTGACCTG AAAAA CAAGAAAAGA 

CAGAGTGATGACACTCTCTGCAGGTTTGAGAGTGTTTTGGACCTGAAGCCCTCTGAGCCT 

GTGGGCACAGGAAACAGTGCCCCCACCCAGACGTCATATCAGCGGCGAGACACGCCCACC 

CTGAGATCTGCAGCCAAGCAGCACTA.TTTGAAGCACTCTCGATACTTGCCTG3GATCAG7 

ATAAGAAATGGCATACTCTCGAATCCAGG3AAGGAATTTATT33ACCTAATC3ATGGTC7 

A G TTCTGGCTTGT 3TG G AAAATCTTC A G GG A CAATGTC A GTAATC AG CAAAGT AAA TTC A 

GTTGGTTCCAGCT3TACAAGTT3TAGTGGACTGA.CTGGAAACTATGTCCCTTCCTTTCT3 

AAAAAAGAAATCGGTTCTGCTATGCAGAGGGTACACCTAGCA.C3TATT3 3AGACCCTTCC 

O ~T A t-"-" ^ ^r^n ^ - j ^ > *-r- -~- » » — ^ ^ Ti -~> -p « .-i ^ r. ^ r T r r,'^"T r "/"'^ * ^ * > ^ 
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SEQ ID NO: 5 9_AA6 j9 94C_M 

AG C A G C AA C AA TGG TG G C A T G AG T G C A G AG G AG G A G A T A G G G 
AGAGGACCAAGCTTGGCTACAAGGGACTGGAGAGATGAGACT 
CAGCAAGGCATAGACCCAGGAGCAGTGAGCCCTGAGCCTGGQ. 

.'""I fT"* 7\ ."^ A 7\ r A 7i PTO Ti T\ ^ r~* r r^ ~r, onnfnn rp /-i r-n rn/-im,i ^rr>,T ^ , 




CTAGATGACAGCGCAGCACCCCCAGCCCCTTTTGAACACCGGGTAGTGAGCATCAAAGAT 

ACCCTGATCTCAGCAGGCTACACGGTATCCCAACATGAAGTCTTAGGAGGGGGTCGGTTT 

GGCCAGGTGCACAGGTGTACAGAGAGGTCTACAGGCCTTGCACTGGCAGCCAAGATCATC 

AAAGTGAAGAACGTAAAGGACCGGGAGGATGTGAAGAATGAGGTCAACATCATGAACCAG 

CTCAGCCACGTAAACTTGATCCAACTTTATGATGCGTTTGAGAGCAAGAACAGCTTCACT 

CTGATCATGGAGTATGTGGATGGAGGCGAACTCTTTGACCGGATCACGGATGAGAAGTAC 

CACCTCACTGAGTTGGATGTGGTCTTGTTCACGAGGCAGATCTGTGAGGGTGTGCATTAC 

CTGCATCAGCACTATATCGTGCACGTGGACCTCAAGGGTGAGAACATATTGTGTGTCAGG 

CAGACAGGGCATCAAATTAAGATCATTGACTTTGGGCTGGCTAGAAGATACAAGCCTCGG 

GAGAAGCTAAAGGTGAACTTTGGTACTGCGGAGTTCCTGGCCCCAGAAGTTGTTAAGTAT 

GAGTTTGTGTCATTTCCAACAGACATGTGGAGTGTGGGAGTTATCA.CCTACATGCTACTC 

AGTGGTTTGTGCCCATTTCTAGGGGAGAGAGATGGAGAGAGCATGAATTTTATTGTGAAC 

TGCAGCTGGGATTTGGATGCTGATACCTTGAAAGGGGTGTCGGAGGAAGCCAAGGAGTTT 

GTTTCCCGGTTACTGGTCAAAGAGAAGAGCTGTAGGATGAGCGGCACACAGTGGCTGAAA 

GACGAGTGGTTAAATCACGTGGCTGCCAAAGCGTCGGGGTCCAACGTTGGCGTGAGATCC 

CAACAACTGCTGCAGAAATATATGGCTCAGAGTAAATGGAAGAAACATTTGGACGTGGTG 

GCTGCAGTCAAC7VGGCTACGGAAATTTCCAACGTGTGCCTAATCTTCAACTCTGGTGTTC 

CAGTGGGCCTGGGAATTCTTGAGGCAACACGAAGTGGTAATATGAAGAGATTACTCAAGA 

TTTTATGTAGATTGGCGCTTTGCTATTATTGATTTTTCTTATTTTGGAAAGAATGATGGA 

AGGAAG C AAG AAAGAAAG AAAAG AAG AAAAG G GGGAAGAAAA G GAAAAGG C A G AAA G C AA 

GGAAACAGGCTACGTTGTTGCTCTTCTTGTAGGTGAAAGTGTTTTTATTAAAAGCGGTAG 

GAATGTTTTTCTGCCTCGTAAGGTCAGCAGGTCTCATATGCTGCTTGCTACCGCGCACCC 

TTCCTTTTGGTAATAAGAGCAGGCACGCTCAGGATGGGCAGGGAAATCCTACTTGGCTTT 

TGGTCAAA TTT G AA TT G TAAA CTTG T CATG AT TAAAG AAG CC AG T AG G GAG GG AG GTATG 

GAAGAGGGAGGAATTAGGTGCAACAGTGGGGGATGAATTTGACCGAAACATTGTATAAAA 
TTCTTAAAGAATTAATAAAATATATTTTTAAAGGAG 

SEQ ID NO: 6C_AA460132_H 

GGAACCTCAGGCTTCAGAGAGCCGAAAAGTTGGGAGGCGTAACCA.CTTACAGGCCGGAAG 

TGTCCGGGGTGGACGCATTCGGGTAGCCGAAGAAGTCCCAGGATTGCCGAAGAAGTCCCA 

GGATTTCCGAAGCGAGGCGAAGCATGGGGACAGTTTTCAGAGACAGCTGATCGGTTGGAG 

CTGTTGCGCCGAGCAGTGATGGCGGCGGGGAGAGGTAGTAGGGCGGCGGATGGGGAGGAG 

CCCGCCCCGGAGGCTGAGGCTCTGGGCGCAGCCCGGGAGCGGAGCAGGCGC7TCTTGAGC 

GGCCTGGAGCTGGTGAAGCAGGGTGGCGAGGCGCGCGTGTTCCGTGGCCGCTTCCAGGGC 

CGCGCGGCGGTGATCAAGCACCGCTTCCCCAAGGGCTACCGGCACCCGGCGCTGGAGGCG 

CGGCTTGGCAGACGGCGGACGGTGGAGGAGGCCCGGGCGCTCCTCCGCTGTCGCGGCGGT 

GGAATATCTGCCCCAGTTGTCTTTTTTGTGGACTATGCTTCCAACTGCTTATATATGGAA 

GAAATTGAAGGCTCAGTGACTGTTCGAGATTATATTCAGTCCACTATGGAGACTGAAAAA 

ACTCCCCAGGGTCTCTCCAACTTAGCCAAGACAATTGGGCAGGTTTTGGCTCGAATGCAC 

GATGAAGACCTCATTCATGGTGATCTCACCACCTCCAACATGCTCCTGAAACCCGCCCTG 

GAAGAGGTGAACATTGTGCTCATAGACTTTGGGGTGAGTTTCATTTCAGCACTTCCAGAG 

GATAAGGGAGTAGACCTCTATGTCCTGGAGAAGGCCTTCCTCAGTACCCATCCCAACAGT 

GAAACTGTGTTTGAAGCCTTTGTGAAGAGCTAGTCCAGCTCCTCCAAAAAGGCCAGGCCA 

G TG C T AAAAAAAT TAG A T G AAG T GCGCCT GAGA G G AAG AAA G A G G T C C ATG G T T G G G T A G 

AAG AATG T G T A T G A C AA C C A C A C A G A G T G AAG GTCTTTTTT G AAA G T AAA T T T G AA G AAA 
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T G C T A C AA i G T A T G A G A T G AG A T C T AA G T AAA G G T G TT AA G A T A T TT T T AA G T 3 G T A TGT " 
AT GG T 3 TC A.TTA TC AT C T 3 G A GT T GA C TC AAG A G G TT A CTATG T G T CT AA G T CA T G TT G T 
AGGGAGAATTGGGTATTTAAAGTAAATTGAGGAGAGGGTTGTGCGAGATTGTGAGATGTA 
TATGTGAGATAGATGGGTGTGGGATTGAAGGAGATAATGAGAACATTATTGTGTTTTTAG 
TCCTTGTGAGACAAGGATGAAGTCTCAGTTGCTGATACTCGCTGAGCTTACTGGCCCTCT 

AACCCAGTGTTTTTTTTTGTTGTTGTTGTGTACATGTTATATTTATTTTGAAACCAGTTT 
AATGGGATACAACCAGCATTTTAAAAAATGAAATAGAATACAGGATGG 

SEQ ID NO: 61_SGKC34_H 

CAGAGAGAGAAGGTAAACCAAGGGAACATGCCAGGGCTTCAGAGCACCTTCCTAGCCATG 
GACACGGAGGAGGGGGTAGAGGTGGTGTGGAACGAGCTCCAGTTCGGAGACAGGAAGGCC 
TTCGCGGGGCACGAGGAGAAGATCCAGACGGTGTTGGAGCAGCTGGTGGTGGTGGACGAC 
CGGAACATCGTGAAGTTGGACAAGTAGTGGGTGGATACGTGTGAGGCGTGGGCGAGGGTC 
ATGTTCATCACAGAGTACGTGTCATCAGGCAGCGTCAAGCAATTCCTCAAAAAGACCAAG 
AAGAACCACAAGGCCATGAACGCGGGGGCGTGGAAGCGCTGGTGCACGCAGATCGTGTCT 
GGGGTCAGCTTGGTGCACGGCTGGAGCCCGGGAATCATCCAGGGGAAGCTGACGAGCGAG 
ACCATCTTCATTCAGCACAAGGGCGTCATCAAGATCGGCTGCGTGTGGCACGGAATCTTC 
TCCAATGCACTTGGAGATGATCTCCGAAGCGGCATGCGCGGTGAGGGAGAGGAAGTTGGG 
AACGTGCACTTCTTGCGCCCAGAGTATGGAGAGGTGGCCGATGGGAGCGGTGTGGAGATC 
TTCTCCTTTGGGATGTGTGCGCTGGAGATGGCTGTAGTGGAAATCCAGAGCAATGGGGAC 
ACCGGGGTCAGAGAGGAGGCCATTGCTGGCGCCAGGGAGTGGGTGAGTGACGGCAAGATG 
CGGGAGTTCATCCTTTGCTGGCTGGCCCGGGACCCTGCCGGCCGGGCGTGTGGCCACAGC 
CTCCTGTTGGAGGGCGTGCTCTTGGAGGTGCAGTGGGTGAAGCTGGTGGCAGGGCAGTGC 
TTCATCCAGCACCAGTACCTCATGCCTGAGAATGTGGTGGAGGAGAAGACCAAGGCGATG 
GACGTGGACGCGGTGTTGGGGGAGCTTCGGCGGGGCGGCAGGGGCGCGGTGCAGTGGCGG 
TACTCGGAAGTCTCCTTCATGGAGCTGGACAAATTCGTGGAGGATGTCAGGAATGGAATC 
TACGCACTGATGAACTTTGCAGCCACTCGACCCGTGGGGCTGCCGCGTGTGGTGGCCCCA 
, . CGCCCGGAGGAGGTCCAAAAGGCCAAGACCGCGACGCCAGAGCGCTTTGACTGTGAGACC 
AGAAAGGTCATCCAGATGCAGTGCAACGTGGAGAGAAGCGAGGACAAGGCGCGGTGGCAT 
CTCAGTCTGCTTCTGGTGCTGGAAGACGGGGTGGACCGGGAGCTGACCTACGAGCTGGTC 
CCAAGGGACAGCGCCCAGGACCTCGCCTCGGAGCTCGTGCAGTATGGGTTGGTCCAGGAG 
GACGACCGGATGAAGCTGGGGGCCTTCCTGGAGAGCACGTTCGTGAAGTAGGGTGGGACC 
GAGGCCTGACGCGGAGCGGCAGCCCCAGGGGACCATGCCGGGGTGGTGCGGGGGCAGGGG 
ATGTTGGGGAGAGTGGAGCACGGTGGGGGTGCGGTGGTCCATGCGGCTGGGAGGACAAAG 
GGCCCGGTAGTGAAGGAACCGCCCGTGTGGTGAGAGTGGGGGTGACCGTGGGTTGGGCGC 
CGAGGGGTTGGGGGGTGGGTGTGGGGGAGGGGTTAGGCCTCCCAGGTCCTTAGGATCAGG 
GTTGGCGGGAGAAGCGGTTCGGATATGGTCCATTGTCGGCCCTGAGTTGGTAGGCAGGCT 
GCCTGGGTGGGGCCACTGCCTCCTCAGGATGCAGGAGGGTGCCGTGTAGGGAAGGCGAGC 
TGTGGGGGTTGGGGGTGAGGGTGAGCCCTGGACAGAGCTGTGCCCAGGGAAGTGGTCCAT 
GGGGTCTGGGAGAGGAGGGATCCCGTGCTGGGAGCATAGACGCACACAAGGAGGGTGCAC 
AGCAAGCCAGCGGTGACACACCTGCAGGTGTCAGGCATGGCACTGGGCACAACAGGGACC 
TGGCAGGAGAAACAGACCACAGAGAGGTCTGGAGTTGAGGCTGTTGTCAGCAAAGCCCCT 
GGTCCCACACAGCTCTGCCCTAGAGCCACCTCTTTGACCCTTTACCCACCCTGAGACCAG 
AACTTGCAGCCCCTCTGCAGATCTCCTCTGGCCACTGCAGCCCCTCCAATGGGCTTTTTC 
TCTCATGCATTCCCTGGCCTGGAGGCGTCAGGGACCCCACATCCTCCCTGCTCCTCAGAC 
TCACAGCCCCTCCATGTTACCTCCCGCACCTCCTCCCTGGGGCAGCTGCTCCCTGGGCCT 
CTGAGGATGTCAGCTCCTGGCTCCCTGCCTCTCTCCCACTCCACTCCTGGCTCAGTCTTA 
GAGATTTCTATGCCCTCATGGATTCTACCCCTGCCTTCCTGGCCTCTTGATTGTTGGCTT 
GCCTCTCCTGCAATTCGAAACTTAGTGAAATGGCCTTAAGCATTTTAAACTGTATGTA'I'A 
CATTAGCGGATTCATGCGTTTCTAAAGGCATTTCAAATGTCAACCAGGAAGGCACACGA3 
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TGTATTAGTTTTATACTGCCGCTGTAAAATTTACCACAAACTTAGTGACTTAACACAAAT 
TTATTGCAATTCTGTAGGCTGGAAGTCTGACTATGGGTCTCACTGGACTAGAATCAAGGC 
TGGGAGGCTGCCTTCCTTCCTGGAGGTTGTAGGGGAGAGTCTGTCTCCTGCTGGTTCAGG 
GTGCTGGCAGAATCGACATCGTTTCGGTGGCAGGGCCAAGGTCCCCACTTTCTTGCTGA Z 
TGTAAACTAAGGCCAGTTGGAGCTTGTAGAGGCTGCCTACATTCCTTGGCTGTTGGCCC r : 
CTCCTCCATCTTCAGAGCTAGCAGGTTCAGTCTGTGTCACGAACCATTTCTCTGGTTCCG 
TGCAGACAGGAAAGGTTGTCCCTAAGGACTCATGAGATTAGGTTGGGCCCAGCCAGATAA 
TACATGATAATCTCCCTCCTCAAGGTTTTTAATATTAAACACATCTGCAGGACACATTTT 
GCCATGTAAACTAACATTCACTGGTTCCAGGGATTAAGGAATGAACCTCTTTTGTTGGGG 

TACTAACGCATATCAAATAAATTAAAGGTACAAAATCGTGAATCTCAGTTATCTTAAATA 
TTCCAATACTATTTACAAAATTATTCAAATTCTCACGCCTTCCAACTCAAAATTAGCAAT 
CTAAAGTAATTTCCATATCCTAGATGGAAACCCTCATGCTAAACTGTCTGATTATGCATG 
GTTCTAAATGGTTTCAGTGGCAAATACATAACATTGTACTACTGATTAAACTGAACTTAA 
AAGC 

SEQ ID NO: 62_AA1 03 2 1 8_M SGK034_M 

CCACGCGTCCGCACCAGAGTATGGCGAAGTCAATGATGGGACTGGCTTTGTGGACATCTT 

CTCCTTCGGGATGTGTGCACTGGAGATGGCTGTACTCGAGATCCAAGCCAACGGGGATAC 

CAGAGTCACAGAAGAGGCCATCGCTCGAGCCAGGCACTCACTGAGTGACCCCAACATGCG 

GGAATTCATCCTCTCCTGCCTGGCCCGGGACCCTGCCCGCCGACCCTCAGCCCACAACCT 

CCTCTTCCACCGAGTGCTCTTTGAGGTGCACTCGCTGAAGCTGCTGGCAGCTCACTGCTT 

CATCCAGCACCAGTACCTCATGCCTGAGAATGTGGTAGAGGAAAAGACCAAGGCCATGGA 

CCTCCATGCAGTTTTGGCTGAGATGCCGCAGCCCCATGGACCCCCAATGCAGTGGCGGTA 

CTCAGAGGTCTCCTTCTTGGAGCTGGACAAATTCCTAGAGGATGTCAGGAACGGGATCTA 

TCCACTGATGAACTTTGCGGCTGCTCGGCCCTTGGGGCTTCCCCGTGTGTTGGCCCCACC 

CCCAGAGGAAGCCCAAAAGGCCAAAACTCCAACGCCAGAACCCTTTGACTCGGAGACCAG 

GAAGGTGGTCCAGATGCAGTGCAACCTGGAAAGAAGCGAGGACAAGGCTCGGTGGCACCT 

TACTCTGCTCTTGGTGCTTGAGGACCGGCTACATCGGCAGCTGACCTATGATCTGCTCCC 

AACGGACAGTGCCCAGGACCTCGCTGCTGAACTAGTGCATTATGGCTTCCTGCACGAGGA 

TGACAGGACAAAGCTAGCAGCCTTTCTGGAGACCACTTTTCTCAAGTACCGAGGGACGCA 

AGCGTGACCTTCCCAGTCCTGACGGCCCAGCAGAGATACAGGGGCTCAGGGTTGTCCACT 

TGGCAAAGAGCCCCCACACTGCTCAAAGCTGCCTTCTGCCTGTGTTCCCTGGAACTGAAC 

ACAGGCCCTGCTAGTGAAGACACCCCCACCCCCCAGCTTTCTGCAGCAGTGTGGGACCCT 

GGGGTGGTGATGGAGCCCTGAGCCTGGACGAGAGTGGATACAGGTCAGTTAGGGGAACCG 

CTCCATCTGGTACTAGACAACAGCCATGCCTTCAGGTGGCATAGAAACCTAGGGAAGGAG 

CCTGAACTCAGGTGTCACAGTGCTGGGCATCAGGCAGACCAGACCTGACCTGATTGGAGA 

ACTGTAGACTAGATAGCTTGGAGTTGAACCCATGGCCAGGGAATTCCTTGGTCCTGCTCA 

GACCAGTCCTGATCCCTTGCAGACCTGCCTTGAGCCCTCTTTCTGATCTTCCACACTCTT 

GAGACCAGGACCTGTGTCCTCCCCAAAGCCCTTGGGAAGGATCTTTCTATTCATCATCCC 

TCTGGCCTAGGGGCTCAGGGGTCAGGCATCCTCCACATTCCCTCCCTGGGGAAGTTGTGT 

GTTTGAGTTGAGGATGTGGGTTCCTGGCTCCCTCTTTCTCCCCAGCCCAACTTGTCTCTT 

TCTTACTGGTTTCAAAGTCCTGATGAACGCTTCCCCTCAGAGCCACCCTGGTTTCCTTGG 

TTCTTGAACTGCCTCTCTCCCAACTTCAAACCAGGTCTTAAACGTTTTTTAAATGCATAT 

ATAAATGTAATGCAGTCACGGTCCTTTTTAAACACTTTGTGTATGAAACCAGGAAAGCTC 

ACTATTGTATTAGGAATAGTTCCACATTGCTGCTGTTAACAGATATCATAAACCCAGTGG 

TTTGAGACGACACACACACACACACACACACACACACACAGAGAGAGAGAGAGTTCTGTA 

CATCAAGTGTGATCCAGGCTCTCAGTAGATTAATACCCAGGCTAAGTTCCTTTCTGGAAG 

CTGGGACTTACCTCCTGCTCCTTCAAGCTATTGGCAGAACTCACTTCCCTGCAATGGTAA 

GGCAGAAATCCCTATTTTCTCAACAGCTGCCAACTAAGAACCCCTCTCAGCTTCTAGAGG 
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CCACCAACTTTTCTTAGTTCTTCTTTCTCCTCCCCTCAAGACCAGCAGCGTCAAGTTGAAT 
CZTTGZ'CCTGGG rTAGGTGACTGGCTTGCCAGTGG r I , GGGA.^GAGTTGGGGGGTTTTGTGA 

GTAGGTTGGACCCAGCAGGATAACCGAGGATGATCCCCTTCTCAGGGTCTATAGATGAAC 

CACACCTGCGCAGTTCCTTCTGCTGTCATCCTGGGCTTTGGTGCTTGGAGAACAGCCGTG 

GGCGGTGGGTGTTGTTACTGTGGTACCTACCATGCCATCTTAACCGAAACCAAGACCTAA 
AATAAAACAGATTTGTCATGGGACATCTAATAAATTAAATGAAGTCTG 

SEQ ID NO: 63_NEK7_H, N34132_H 

CAGGAATCCGAGCCCGCTCGCCTCTCTGCAGCGAACCGACCATGTCTGGCGGCGGCGCAG 
AGAAGGAGAGCAGCACTGCCGGTTCCCTGTTCCTGTCGCCGCCGGCTCCTGCCCCCAAGA 
ACGGCTCCAGCTCCGATTCCTCCGTGGGGGAGAAACTGGGAGCCGCGGCCGCCGACGCTG 
TGACCGGCAGGACCGAGGAGTACAGGCGCCGCCGGGACACTATGGACAAGGACAGCCGTG 
GGGCGGCCGCGACCACTACCACCACTGAGCACCGGTTCTTCCGCCGGAGCGTCATCTGCG 
ACTGCAATGCCAGTGGAGTGGAGGTTCCGGGCGTTCCTCTTTCGCTGCGCCAGGCCAGCA 
TCGGCGGGGCTGTCGGGGAGAGTGCTCCACCGGAGGGGCAGCGGGAAGAGACCGTGACCG 
CGACCGCCACTTCGGAGGTAGCCCAGCAGGCTCCAGGGGCTGCCGCCCGTGGGGAAGAGG 
GGGTGGCGGGGCCTGCGGGGTGGAGTGTGGGCAGCAGTACCAGCAAAGAGCGCCGAGTGT 
CGCAGCCTAGGGTTGTGGGGAGCAAAGAGGAGCCGCCGCCGGCGAGAAGTGGCAGCGGCG 
GCGGCAGGGGGAAGGAGGCACAGGAGGAACGGAGCGAGCAGCAGGATGATATGGAAGAGC 
TGGAGACCAAGGGGGTGGGAATGTGTAAGGATGGCCGGTTTCTCAAGTTTGACATCGAAA 
TGGGCAGAGGGTCCTTTAAGACGGTGTACAAAGGTCTGGACACTGAAACCACCGTGGAAG 
TCGCCTGGTGTGAACTGCAGGATCGAAAATTAAGAAAGTCTGAGAGGCAGAGATTTAAAG 
AAGAAGCTGAAATGTTAAAAGGTCTTCAGCATCGCAATATTGTTAGATTTTATGATTCCT 
GGGAATCCACAGTAAAAGGAAAGAAGTGCATTGTTTTGGTGACTGAACTTATGACGTCTG 
GAACACTTAAAACGTATGTGAAAAGGTTTAAAGTGATGAAGATCAAAGTTCTAAGAAGCT 
GGTGCCGTCAGATCCTTAAAGGTCTTCAGTTTCTTCATACTCGAACTCCACTTATCATTC 
AGGGCGATCTTAAATGTGACAACATGTTTATCACCGGGCCTACTGGCTCAGTCAAGATTG 
_ QAGACCTCGGTCTGGCAACCCTGAAGCGGGCTTCTTTTGCCAAGAGTGTGATAGGTACCC 
CAGAGTTCATGGCCCGTGAGATGTATGAGGAGAAATATGATGAATCCGTTGACGTTTATG 
CTTTTGGGATGTGCATGGTTGAGATGGCTACATCTGAATATCCTTACTCGGAGTGCCAAA 
ATGCTGCGCAGATGTAGGGTGGGGTGACGAGTGGGGTGAAGCCAGCCAGTTTTGACAAAG 
TAGCAATTCCTGAAGTGAAGGAAATTATTGAAGGATGCATACGACAAAACAAAGATGAAA 
GATATTCCATCAAAGACGTTTTGAACCATGCCTTCTTCCAAGAGGAAACAGGAGTACGGG 
TAGAATTAGCAGAAGAAGATGATGGAGAAAAAATAGCCATAAAATTATGGCTACGTATTG 
AAGATATTAAGAAATTAAAGGGAAAATACAAAGATAATGAAGGTATTGAGTTTTGTTTTG 
ATTTAGAGAGAGATGTGGGAGAAGATGTTGCACAAGAAATGGTAGAGTGTGGGTATGTCT 
GTGAAGGTGATCACAAGACCATGGCTAAAGCTATCAAAGACAGAGTATCATTAATTAAGA 
GGAAACGAGAGCAGGGGGAGTTGGTACGGGAGGAGGAAGAAAAGAAAAAGCAGGAAGAGA 
GCAGTCTCAAAGAGCAGGTAGAACAATCCAGTGCTTGCCAGACAGGAATCAAGCAGCTCC 
GTTGTGGTAGCAGCGGGATACCTACTGCTTCTACCACTTCAGCTTCAGTTTGTACACAAG 
TAGAACGTGAAGAAGGTGAGGCAGATCAACATGAAGAACTACAGTACGAGGAACCGAGTA 
TATGTGTGTTATGTGATGGGACGGTTGAGAGTGGTCAGGGATCCTGTGTGTTGAGAGAAT 
GTGGAGTGAGCAGGCAACAGAGAGTTTCATATGGGTTGCCAANNCATGAAGAGGCACATT 
GTAGAGGCACAGTCCCAGGGCATATACCTTCTACTGTCCAAGCAGAGTGTCAGCGCCATG 
GGGTATATCCAGGCTCAAGTGTGCAGCAGGGAATAGAGGAGACAGGGGGTGCTCAACAGA 
GAGTGCAGTATTCACTTTGACAGACATCAAGGTGCAGTGAGGGCACTACTGCACAGCCAG 
TGAGTCAGGCTCAAGCTCCACAAGTGTTGCGTGAAGTATCAGGTGGAAAACAGAGTAGTG 
AGGGAGTGTGTCAGGTTGCTGGTGCAGAGCGAGTTGCAGTAGGACAGGGCCAAGCTAGCC 
AGCCGACCACTTTGGCTTCCTCTGTAGACAGTGCACATTCAGATGTTGCTTCAGGTATGA 
GTGATGGCAATGAGAACGTCCCATCTTCCAGTGGAAGGCATGAAGGAAGAACTACAAAAC 
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GGCATTACCGAAAATCTGTAAGGAGTCGCTCTCGACATGAAAA7VACTTCACGCCCAAAAT 

TAAGAATTTTGAATGTTTCAAATAAAGGAGACGGAGTAGTAGAATGTCAATTAGAGACTC 

ATAATAGGAAAATGGTTACATTCAAATTTGACCTAGATGGTGACAACCCCGAGGAGATAG 

CAACAATTATGGTGAACAATGACTTTATTCTAGCAATAGAGAGAGAGTCGTTTGTGGATC 

AAGTGCGAGAAATTATTGAAAAAGCTGATGAAATGCTCAGTGAvGGATGTCAGTGTGGAAC 

CAGAGGGTGATCAGGGATTGGAGAGTCTACAAGGAAAGGATGACTATGGCTTTTCAGGTT 

CTCAGAAATTGGAAGGAGAGTTCAAACAACCAATTCGTGCGTCTTCCATGCCACAGCAAA 

TAGGCATTGCTAGCAGTTCTTTAAGTCAAGTTGTTCATTGTGCGGGAAGGCGGTTTATAG 

TGAGTCCTGTGCCAGAAAGCCGATTACGAGAATCAAAAGTTTTGCCCAGTGAAATAACAG 

ATACAGTTGCTGGCTGTACAGCTCAGAGGCCTGGAATGAAGTTGTCTCAGTCTGCATCAT 

CCCTTAGTCTACAACAGGCCTTTTCTGAACTTAGACGTGGGCAAATGAGAGAAGGACCCA 

ATACAGCACGTCCAAACTTTAGTCATACAGGACCAACATTTCCAGTAGTACCTCCTTTCT 

TAAGTAGCATTGCTGGAGTCCCAACCACAGCAGCAGCCACAGCACCAGTGCCTGGAACAA 

GCAGCCCTGCTAATGACATTTCCACATCAGTAATTCAGTCTGAGGTTACAGTGCGCACTG 

AAGAGGGGATTGCTGGAGTTGCCACCAGCACAGGTGTGGTAACTTCAGGTGGTCTCCCCA 

TACCACCTGTGTGTGAATCACCAGTACTTTCCAGCGTAGTTTCAAGTATCACAATACGTG 

CAGTTGTCTCAATATCTAGTACATCCCCGTCACTTCAAGTCCCCACATCCACATCTGAGA 

TCGTTGTTTCTAGTACAGCAGTGTATCCTTCAGTAACAGTTTCAGCAACTTCAGCCTCTG 

CAGGGGGCAGTACTGCTACCCCAGGTCCTAAGCCTCCAGCTGTAGTATCTCAGCAGGCAG 

GAGGGAGCACTACTGTGGGAGCCACATTAAGATCAGTTTCTACCACCACTTCATTCCCAA 

GCACAGGTTCACAGCTGTGCATTCAGCTTAGCAGCAGTACTTCTACTCGTACTTTAGCTG 

AAACCGTGGTAGTTAGCGCACAGTCACTAGATAAGACATCTCATAGCAGTACAACTGGAT 

TGGCTTTCTCCCTCTCTGCACCATCTTCCTCTTCCTCTGCTGGAGCAGGAGTGTCTAGTT 

ATATTTCTCAGCGTGGTGGGCTGCATCCTTTGGTGATTGGATCAGTGATAGGTTCTAGTC 

CTATTCTTCCCCAAGCAGCAGGACGTACTTCTACACCTTTATTAGCCGAAGTACCTAGTA 

TCCCACGCTTGGTACAGCCTGTTGCCAATGTGCGTGCTGTACAGCAGACAGTAATTGATA 

GTCAGCCTCAACCAGCTTTGCTTCCCAACCAGCCCCATACTCATTGTCCTGAAGTAGATT 

CTGATACACAACGCAAAGCTCCTGGAATTGATGACATAAAGACTGTAGAAGAAAAGCTGC 

GGTCTCTGTTCAGTGAACACAGGTCATCTGGAGGTCAGCATGGGTCTGTCTCACTGGAGA 

CGTCACTAGTGATAGAGAGCACTGTCACACCAGGCATCCCAACTACTGCTGTTGCACCAA 

GGAAACTCCTGACTTCTACCACAAGTACTTGCTTACCACCAAGCAATTTACCACTAGGAA 

CAGTTGCTTTGCCAGTTACACCAGTGGTCACACCTGGGCAAGTTTCTACCCCAGTCAGCA 

CTACTACATCAGGAGTGAAACCTGGAACTGCTCCCTCCAAGCCACCTCTAACTAAGGCTC 

CGGTGCTGCGAGTGGGTACTGAAGTTCCAGGAGGTAGTGTACCCAGGGAGCAGCTGGCAC 

CTTTTCGAGGACCTTCTCTAACCCAGTCCCAGCAACCTGTAGAGGATCTTGATGGTCAAT 

TGAG/LAGAACAGTTAGTGCAGAGATGATCACAGTGACTTCTGCGGTTGGTCCTGTGTCGA 

TGGCGGCTCCAACAGCAATCAGAGAAGCAGGAACACAGCGTGAGAAGGGTGTTTCTGAAG 

TCAAAGAAGGGGCTGTCGTAGCT^GTAGTTCAGGAGCTGGTGTTTTTTVAGATGGGAGGAT 

TTCAGGTTTCTGTTGCAGGAGACGGTGCCGAGAAAGAGGGTAAAAATAAGTCAGAAGATG 

CT^AAGTCTGTTCATTTTGAATCCAGCACCTCAGAGTCGTCAGTGGTATGAAGTAGTAGTC 

CAGAGAGTACCTTGGTGAAACCAGAGCCGAATGGCATAAGGATCGGTGGTATGTCTTCAG 

ATGTGCCAGAGAGTGCCCACAAAACTACTGCCTCAGAGGCAAAGTCAGACACTGGGCAGC 

CTACCAAGGTTGGAGGTTTTGAGGTGACAACTACAGCAAACAAAGTGGGTGGTTTCTGTG 

TATCAAAAACTGAGGACAAGATCACTGACACAAAGAAAGAAGGACCAGTGGCATCTCCTC 

CTTTTATGGATTTGGAACAAGCTGTTCTTCCTGCTGTGATACCAAAGAAAGAGAAGCCTG 

AACTGTCAGAGCCTTCACATGTAAATGGGCCGTCTTCTGACCCGGAGGCCGCTTTTTTAA 

GTAGGGATGTGGATGATGGTTCCGGTAGTCCACACTCGCCCCATCAGCTGAGCTCAAAGA 

GCCTTCCTAGCCAGAATCTAAGTCAAAGCCTTAGTAATTCATTTAACTCCTCTTACATGA 

GTAGCGACAATGAGTCAGATATCGAAGATGAAGACTTAAAGTTAGAGCTGCGACGACTAC 

GAGATAAACATCTCAAAGAGATTCAGGACCTGCAGAGTCGCCAGAAGCATGAAATTGAAT 
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CTTTGTATACCAAACTGGGCAAGGTGCCCCCTGCTGTTATTATTCCCCCAGCTGCTCCCC 

TTTCAGGGAGAAGACGACGACCCACTAAAAGCAAAGGCAGCAAATCTAGTCGAAGCAGTT 

CCTTGGGGAATAAAAGCCCCCAGCTTTCAGGTAACCTGTCTGGTCAGAGTGCAGCTTCAG 

TCTTGCACCCCCAGCAGACCCTCCACCCTCCTGGCAACATCCCAGAGTCCGGGCAGAATC 

AGCTGTTACAGCCCCTTAAGCCATCTCCCTCCAGTGACAACCTCTATTCAGCCTTCACCA 

GTGATGGTGCCATTTCAGTACCAAGCCTTTCTGCTCCAGGTCAAGGTAATAAAGCAACCA 

TCATCGTCCAAAAACAATAAAATGGAGATGTTGCCATACCTGGGACAAAAGCCTGTTAAG 

GCGGGTTGGGAGACTAGCTGACCAGAACACAGCCTGTGTGTTGTACACTGAAGAATCTGG 

GTGAAAAGGGAAGTGGAGTGATAATGAGAATCGGTGGGCTCACTGCTCCCATTAGGTGAA 

ATTACTTTTTTTCAAGGAATTACAGTGAAAAGTTACATCTGTGTGGCCTATATGACTTGC 

TCATTTGGGATTTGGAACTTAGGCTTTAATATTAGGCTGAGATTTCCTGGATGAAATTCT 

AAGGTGTTTTAGCAGTTTCTGAAGCTAATACATTTTCTTAGCCATTGTAGAATTTTGTTA 

CTTTTAAGTATGGGAGTGGCATACTAAAATGAATAACCTTACAATTCAGTTTTTTATCCA 

T7VATCTACTTTCCAAATATAGCTCTGTTTATTAGTGATTGCTGAAAAAATTCCCACAGAG 

GAAAGAGCTTTTAGTCATATTAGAACAAGAATTGAAAAGACTTGGGCATCTGGGTGAGAA 

GAATGAAAAAAATATAGGTACTGGCTTATGTGCCTTTGCCACAGTTTCACAGAAATTAGA 

GATCAGTCTCTTCACAGGAAGAATGCACTTGATTGGTAAGGAGGGCAAACTAGCTAGCAT 

TATTCGAACTAAGAAAAGCTTCCGCATTTTGCAGATGGGTAGAATTAAGACCTAATATTT 

CATCTCTTACATATCTGACCTTCCCCCCAGAAGCTTGTTCTTCTGTGTGCCATGTTAGTG 

CATTTCACCACTCCAGCCTCAAGTTTCTAACATCTTGTAGTTGTGTTCTGTCTCTTCTCC 

TCTCTCTGTTCTACCCTGTTTTTCCCCTCTCACAGGCTGTGCGAAGTTTAACTGTGCATC 

TGAACAGGTGACATTCAAACCTGGTGGCAGGAGGACCCGATTTCTGAGTACGCCCTGGTT 

GGCTCTTTGTGTGTAACACCTTTACTCGTTCCTTGTCCTTGTGTTTCTGCTGGTTGGATC 

TGATGTTTCACGCAGTCCATTTTCATTTGTCTCTTTTTGTATATCATCTACTCAGTGGCT 

TGGCTGAATTACTGTTACCCTCAGAAGTTTGGGCCCCCACATTAATTATGATAAAAAATG 

TCAAAATAACAAGTTATCTACAAATTTCAATGTAACTTTCTGGTAGAAGTGCTTCTTCAT 

GGATCTGTGACAGAGAGTGGATATGGTATCTAGGCAATAGATTGCTGGGTCATTTAGAAT 

AATGAAGACTGAACTCCACAGTCGTAGTCAGTGCTGTCTGTCTGCCCTAGCATTAGAAAT 

GAGAGAAATCAGCCAGACACGGTGGCGTACACCTGTAATCCCAGCACTTTGGGAGGCCGA 

GGCGGGAAGATTGGTTGAGGGCAGGAGCTCGAGACCAACCCTGGGCAACATGGTGATACC 

CCATGTCT 

SEQ ID NO: 64_BCON3_H 

GCGGAGCGCAGCTGTGAGGGAGTCGCTGTGATCCGGGGCCCCGGAACCCGAGCTGGAGCT 
GAAGCGCAGGCTGCGGGGCGCGGAGTCGGGAGGCCTGAGTGTTCCTTCCAGCATGTCGGA 
GGGGGAGTCCCAGACAGTACTTAGCAGTGGCTCAGACCCAAAGGTAGAATCCTCATCTTC 
AGCTCCTGGCCTGACATCAGTGTCACCTCCTGTGACCTCCACAACCTCAGCTGCTTCCCC 
AGAGGAAGAAGAAGAAAGTGAAGATGAGTCTGAGATTTTGGAAGAGTCGCCCTGTGGGCG 
CTGGCAGAAGAGGCGAGAAGAGGTGAATGAACGGAATGTACCAGGTATTGACAGTGCATA 
GCTGGCCATGGATACAGAGGAAGGTGTAGAGGTTGTGTGGAATGAGGTACAGTTCTCTGA 
ACGCAAGAACTACAAGCTGCAGGAGGAAAAGGTTCGTGCTGTGTTTGATAATCTGATTCA 
ATTGGAGCATCTTAACATTGTTAAGTTTCACAAATATTGGGCTGACATTAAAGAGAACAA 
GGCCAGGGTCATTTTTATCACAGAATACATGTCATCTGGGAGTCTGAAGCAATTTCTGAA 
GAAGACCAAAAAGAACCACAAGACGATGAATGAAAAGGCATGGAAGCGTTGGTGCACACA 
AATCCTCTCTGCCCTAAGCTACCTGCACTCCTGTGACCCCCCCATCATCCATGGGAACCT 
GACCTGTGACACCATCTTCATCCAGCACAACGGACTCATCAAGATTGGCTCTGTGGCTCC 
TGACACTATCAACAATCATGTGAASACTTGTCGAGAAGAGCAGAAGAATCTACACTTCTT 
TGCACCAGAGTATGGAGA/^GTCACTAATGTGACAACAGCAGTGGACATCTACTCCTTTGG 
CATGTGTGCACTGGAGATGGCAGTGCTGGAGATTCAGGGCAATGGAGAGTCCTCATATGT 
G C C A C AGG AA G C C A T C A G C A G TG C C A T CCA G C T T CT AG AA G A C C G AT T A C A G A G G G A G T T 
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CATTCAAAAGTGCCTGCAGTCTGAGCCTGCTCGCAGACCAACAGCCAGAGAACTTCTGTT 
CCACCCAGCATTGTTTGAAGTGCCCTCGCTCAAACTCCTTGCGGCCCACTGCATTGTGGG 
ACACCAACACATGATCCCAGAGAACGCTCTAGAGGAGATCACCAAAAACATGGATACTAC- 
TGCCGTACTGGCTGT^AATCCCTGCAGGACCAGGAAGAGAACCAGTTCAGACTTTGTACT'J 
TCAGTCACCAGCTCTGGAATTAGATAAATTCCTTGAAGATGTCAGGAATGGGATCTATCC 
TCTGACAGCCTTTGGGCTGCCTCGGCCCCAGCAGCCACAGCAGGAGGAGGTGACATCACC 
TGTCGTGCCCCCCTCTGTCAAGACTCCGACACCTGAACCAGCTGAGGTGGAGACTCGGAA 
GGTGGTGCTGATGCAGTGCAACATTGAGTCGGTGGAGGAGGGAGT<-AAACACCACCTGAC 
ACTTGTGCTGAAGTTGGAGGACAAACTGAACCGGCACCTGAGCTGTGAGGTGATGGCAAA 
TGAGAATATCCCCGAGTTGGCGGCTGAGCTGGTGCAGCTGGGCTTCATTAGTGAGGCTGA 
CCAGAGCCGGTTGACTTCTCTGCTAGAAGAGACCTTGAACAAGTTCAATTTTGCCAGGAA 
CAGTACCCTCAACTCAGCCGCTGTCACCGTCTCCTCTTAGAGCTCACTCGGGCCAGGCCC 
TGATCTGCGCTGTGGCTGTCCCTGGACGTGCTGCAGCCCTCCTGTCCCTTCCCCCCAGTG 
AGTATTACCCTGTGAAGCCCCTTCCCTCCTTTATTATTCAGGAGGGCTGGGGGGGGTCCC 
TGGTTCTGAGCATCATCCTTTCCCCTCCCCTCTCTTCCTCCCCTCTGCACTTTGTTTAGT 
TGTTTTGCACAGACGTGGGCCTGGGCCTTCTCAGCAGCCGGCTTCTAGTTGGGGGGTAGT 
CGCTGATCTGCCGGCTCCCGCGCAGCCTGTGTGGAAAGGAGGCCGACGGGGACTAGGGGA 
GCGGAATTCTACAATCCGGGTGGGGCGGGGGGGGCGGGAGAGAAAGGTGGTGCTGGAGTG 
GTGGGGCTGGGGGGCCATTGGATTCGCGTGAGTTGCTGGTGTAATAAAAGTGTAGTTTTT 

GGT 

SEQ ID NO: 65_AA711829_M 

CTTAAGCAGTTTCTGT^GAAGACGAAAAAGAACCACAAGACTATGAATGAAAAGGCTTGG 
AAACGCTGGTGTACACAGATGGTCTGTGCCCTAAGCTACCTGCACTCCTGTGACCCTCCG 
ATCATCCATGGGAACCTGACCTGTGACACCATCTTCATCCAGCACAACGGACTCATCAAG 
ATTGGCTCTGTGGCTCCTGAGACTATCAACAATCACGTGAAGACTTGCGGGGAAGAACAG 
AAGAACCTACACTTTTTTGCACCAGAGTATGGAGAAGTCACAAACGTGACAACAGCAGTG 
GACATCTACTCCTTTGGCATGTGTGCACTGGAGATGGCAGTGCTGGAGATTCAGGGCAAT 
GGCGAGTCCTCATATGTGCCACAGGAAGCCATCAGCAGTGCCATCCAGCTACTAGAAGAC 
TCATTACAGAGGGAGTTTATTCAAAAGTGCCTGCAGTCTGAGCCTGCTCGGAGACCAACA 
GCGAGAGAACTTCTGTTCCACCCAGCACTGTTTGAAGTGGCCTCACTCAAGCTTCTTGCT 
GCTCACTGTATCGTGGGGCACCAACACATGATCCCAGAGAACGCTCTAGAGGAGATCAGC 
AAGAACATGGATACCAGTGCTGTACTAGCTGAAATTCCCGCAGGGCCAGGACGAGAACCA 
GTTCAGACTTTGTACTCTCAGTCACCAGCCCTAGAATTAGACAAATTCCTTGAAGATGTC 
AGGAATGGGATCTACGCTCTGACAGCCTTTGGGCTACCTCGGCGTCAGCAGCCACAGCAG 
GAGGAGGTGACATCACCTGTTGTGCCCCGGTCTGTCAAGACTCCAACTCCTGAGCCAGCT 
GAAGTGGAGACACGAAAGGTGGTGGTGATGCAGTGGAACATCGAATGTGTGGAGGAGGGA 
GTCAAACACCATCTAACACTTCTGCTGAAGCTGGAGGACAAATTGAACCGGCACCTGAGC 
TGTGACCTGATGCCAAATGAGAGCATCCCGGACTTGGCAGCTGAGCTGGTGCAGCTGGGC 
TTCATTAGTGAGGCTGATCAGAGCGGGCTGACTTCTCTGCTGGAGGAGAGGCTCAAGAAG 
TTCAACTTCACCAGGAACAGTACACTCAACACAGCCACTGTCACCGTCTCCTCGTAGAGC 
TCACTTGAGCCAGGCCCCTAGCCAGGCTGTGGCTGTCCCTGGGCATGCTGCAGTCCTCCT 
GTCCCTTCTCCCCAGTCAGTATTACCCTTCGCGCCCATATTATTTAGGAGGGCTTTAGGG 
GCTCCCTGGTTGAGTATCACCCTGCCCCTTCCCCTCTCTTCCTCCCCTCTGCACTTTGTT 
TACTTGTTTTGCACAGACGTGGGCCTGGGCCTTCTCAGCAGCCACCTTCTAGCTGGGGGC 
TAGTAGCTGACCTGCTGCCTCCTGCCCTACTTGTGTGGACAGGAGGCCCACGGGCACTGG 
GGAAGCTGAGTTCTACAATCCCGCTGGGGCGCATGGGCAGGAGAGAAAGGTGGTGCTGCA 
GGGGTGGCCCCCCGGGGGGGGCATTCGAATCACCTCAGTTGCTGCTGTAATAAAGTCTAC 

TTTTTGGT 
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SEQ ID NO: 66_AA099102_H 

ATGTCATCATGTGTCTCTAGCCAGCCCAGCAGCAACCGGGCCGCCCCCCAGGATGAGCTG 
GGGGGCAGGGGCAGCAGCAGCAGCGAAAGCCAGAAGCCCTGTGAGGCCCTGCGGGGCCTC 
TCATCCTTGAGCATCCACCTGGGCATGGAGTCCTTCATTGTGGTCACCGAGTGTGAGCCG 
GGCTGTGCTGTGGACCTCGGCTTGGCGCGGGACCGGCCCCTGGAGGCCGATGGCCAAGA'2 
GTCCCCCTTGACACCTCCGGGTCCCAGGCCCGGCCCCACCTCTCCGGTCGCAAGCTGTCT 
CTGCAAGAGCGGTCCCAGGGTGGGCTGGCAGCCGGTGGCAGCCTGGACATGAACGGACGC 
TGCATCTGCCCGTCCCTGCCCTACTCACCCGTCAGCTCCCCGCAGTCCTCGCCTCGGCTG 
GCGCGGCGGCCGAGAGTGGAGTCTCACGACGTGTGCATCACGGGTATGCAGGACTGTGTG 
CAGCTGAATCAGTATACGGTGAAGGATGAAATTGGAAAGGGGTGCTATGGTGTCGTGAAG 
TTGGCCTACAATGAAAATGACAATACCTACTATGCAATGAAGGTGGTGTCCAAAAAGAAG 
CTGATCCGGCAGGCCGCTTTTCCACGTCGCCCTCCACCCCGAGGCACCCGGCCAGCTCCT 
GGAGGCTGCATCCAGCCCAGGGGCCCCATTGAGCAGGTGTACCAGGAAATTGCCATCCTC 
AAGAAGCTGGACCACCCCAATGTGGTGAAGCTGGTGGAGGTCCTGGATGACCGCAATGAG 
GACCATCTGTACATGGTGTTCGAACTGGTCAACCAAGGGCCGGTGATGGAAGTGCGGAGC 
CTCAAACCACTCTCTGAAGACCAGGCCCGTTTCTACTTCCAGGATCTGATCAAAGGCATC 
GAGTACTTACACTACCAGAAGATCATCCACCGTGACATCAAACCTTCCAACCTCCTGGTC 
GGAGAAGATGGGCACATCAAGATCGCTGACTTTGGTGTGAGCAATGAATTCAAGGGCAGT 
GACGCGCTCCTCTCCAACTACGTGGGCACGCCCGCCTTCATGGCTCCCGAGTCGCTCTCT 
GAGACCCGCAAGATCTTCTCTGGGAAGGCCAAGGATGTTTGGGCGATGGGTGTGACACTA 
TACTGGTTTGTCTTTGGGCAGTGCCCATTCATGGACGAGCGGATGATGTGTTTACACAGT 
AAGATCAAGAGTCAGGCCCTGGAATTTCCAGACCAGCCCGACATAGCTGAGGACTTGAAG 
GAGGTGATCACCCGTATGGTGGACAAGAACCCGGAGTCGAGGATCGTGGTGCCGGAAATG 
AAGGTGCACCCCTGGGTCAGGAGGGATGGGGCGGAGCCGTTGCCGTCGGAGGATGAGAAC 
TGGACGCTGGTCGAAGTGACTGAAGAGGAGGTCGAGAACTCAGTCAAACACATTCCCAGC 
TTGGCAACCGTGATCCTGGTGAAGACCATGATACGTAAACGCTCCTTTGGGAACCCATTC 
GAGGGCAGCCGGCGGGAGGAACGCTCACTGTCAGCGCCTGGAAAGTTGCTCACCAAAAAA 
CCAACCAGGGAATGTGAGTCCCTGTCTGAGCTCAAGGAAGCAAGGCAGCGAAGACAACCT 
CCAGGGCACCGACCCGCCCCCCGTGGGGGAGGAGGAAGTGCTCTTGTGAGAGGCAGTCCC 
TGGGTGGAAAGTTGCTGGGCCCCCGCGCCCGGCTCCCCCGCACGCATGCATCCAGTGCGG 
CGGGAGGAGGCCATGGAGCCCGAGTAG 

SEQ ID NO: 67_5R69_17_2_H 

CCGGGATGTGAGCCTGGTGGTTGGCAGCTGGAGCCACGTCGGAGGGGGAAGTGTCGCAGC 
ATTCTCTGCAGGCATCACAGACCTGAGGCAGTGGCGTCCGGAGGGCACTGGACAGAAACA 
GCCATCCAAGTGGGTGAGTGGAGGGACCCTGCTCAAGTGCAGCTGCAGTGGCCGGGGTTT 
CCCTCAGGTAGGGATCGGGGCGCCTTGTCGCCGCCAGCCACGTGTGGCGTCCGGTACAGT 
CAGCAGAGTGCAGGGTGCGGGCACCAGGAAAGGGGGCGCAGGGGAACTCCCGCGGGCCTC 
GCGTTTGCAAACTTCTCGCCTGGGCAGGAGGCGGTCGTGGGAAAGAAGGTGGAAGAGCGA 
GCTTTTTGGAACTGTGCACGGGACAGATTGGACGCACACCCCTCGGGAGGCGCGAAGGCA 
TGGAAAATTTGAAGCATATTATCACCGTTGGCCAGGTCATCCACAAAGGGTGTGAAGAGA 
TGAAATACTGCAAGAAACAGTGCCGGCGCCTGGGCCACCGCGTCCTCGGCCTGATCAAGC 
CTCTGGAGATGCTCCAGGACCAAGGAAAGAGGAGCGTGCCCTCTGAGAAGTTAACCACAG 
CCATGAACCGCTTCAAGGCTGCCCTGGAGGAGGCTAATGGGGAGATAGAAAAGTTCAGCA 
ATAGATCCAATATCTGCAGGTTTCTAACAGCAAGCCAGGACAAAATACTCTTCAAGGACC; 
TGAACAGGAAGCTGAGTGATGTGTGGAAGGAGCTGTGGCTGTTAGTTCAGGTTGAGCAA<" , 
GCATGCCTGTTTCACCCATAAGCCAAGGAGCGTCCTGGGCACAGGAAGATCAGCAGGATf."; 
GAGACGAAGAGAGGCGAGGTTTGCAGATGCTAAGAAGAGATAATGAAAAAATAGAAl^CT^^ 
CACTGAGACGATTAGAAATCAACATGAAAGAAATCAAGGAAACTTTGAGGCAGTGTAAG':' 
TATCATGTGCCCTGCTGTTTCTGATGGCCCCCAAACTAGAAGTCATGAGTTTACTGG«'JAG 
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CCCAGCCTCCCGCTACCCCTGCATTTGTCCATTTTCTGTGCTGGATGGCTGGAAGCAGCC 

CACAGGTTTGGGGATCCATTCATGGCTAGCCCAGGCTTCTGTCCATGGAATAACATGTGG 

AGAGAGCTTCTTGACCAGTAAGATACCTTCTAGCAGCTGTCAAAGTACTTAAAAACCTCT 

ATGAATAGAATCAAAGCTTCAGTTCAGTTGCTGAATTTCCAAGAAGAAATTCAAATCAAA 

TTTAAAATGCCCACTCATTCATTCATTCAACAAAACTGTGAGTATCTGGTTTATGCCAGA 

GGCCATGCAAAGAGGTAACTAAGATGCAGAGAAGGACACTGCCTTCCAGGAGCTCACGGG 

GTGGAGGAGGAAAGAGGAAAGACAGACAGTGAACACACAACAGCAAGGTTACTGAGCTTG 

AACTATGTCCCTAACTACTAGATCTGAAATGACTACGCCAGATGCCAGATGCTCAAGTGC 

CAAGCTCTGGGTAACAGGAATAGACATCCTTCCAGGATGAGAGAGATGAGTCTGGATGAG 

GGTTAAGGCTGGAGGGACAGGCGGGATTTGAAGAGGAGGGAAAGGAAGTGGATGACACAT 

TCTGTTAAGTGTCCAGCTGTGTCTCTACTGGTCACTCAGAGGCACGGGAGCCGCTCCCTT 

GGGCTGAGTCCATCAGAAGCCCCAGCCACCACCAGCTCTGGTTGATGTAGTAGAGCTTCG 

CACTCACACATCACAAATATGCCACCTCCCTTAGGACCCGTTCGTCTGCTCATTGACTCT 

TTTGTCTTGTTTCCTCTGGGGGGTGAGGTCAGATTTACCACGAAAATGCATGCAGGAGAT 

CCCGCAAGAGCAAATCAAGGAGATCAAGAAGGAGCAGCTTTCAGGATCCCCGTGGATTCT 

GCTAAGGGAAAATGAAGTGAGCACACTTTATAAAGGAGAATACCACAGAGCTCCAGTGGC 

CATAAAAGTATTCAAAAAACTCCAGGCTGGCAGCATTGCAATAGTGAGGCAGACTTTCAA 

TAAGGAGATCAAAACCATGAAGAAATTCGAATCTCCCAACATCCTGGGTATATTTGGGAT 

TTGCATTGATGAAACAGTGACTCCGCCTCAATTCTCCATTGTCATGGAGTACTGTGAAGT 

CGGGACCCTGAGGGAGCTGTTGGATAGGGAAAAAGACCTCACACTTGGCAAGCGCATGGT 

CCTAGTCCTGGGGGCAGGCCGAGGCCTATACCGGCTACACCATTCAGAAGCACCTGAAGT 

CCACGGAAAAATCAGAAGCTCAAACTTCCTGGTAACTCAAGGCTACCAAGTGAAGCTTGC 

AGGATTTGAGTTGAGGAAAACACAGAGTTCCATGAGTTTGGGAACTACGAGAGAAAAGAG 

AGACAGAGTCAAATGTACAGCATATCTCTCACCTCAGGAACTGGAAGATGTATTTTATCA 

ATATGATGTAAAGTCTGAAATATACAGCTTTGGAATCGTCCTCTGGGAAATCGCCACTGG 

AGATATCCCGTTTCAAGGTGAAGAATGTGAAGACTGGCTCAGCCAGTGGCTGTAATTCTG 

AGAAGATCCGCAAGCTGGTGGCTGTGAAGCGGCAGCAGGAGCCACTGGGTGAAGACTGCC 

CTTCAGAGCTGCGGGAGATCATTGATGAGTGCCGGGCCCATGATCCCTCTGTGCGGGGCT 

CTGTGGATGAAATCTTAAAGAAACTCTCCACCTTTTCTAAGTAGTGTATCAAAATCTAAA 

CCAAGGAGTCTCTGGACAAGAAGCTGGGAGAGGCACGAACTGGACATCTCTCTGTCTCAT 

ATGGTTCGGCATTGGGTTATCTATGGGTGCAAGGAGTGGGCACGCTTCTCTGTTACAAAT 

AGAAAACGATTCCAGTCATACAGGAGACATCCCAGTCCAAATGATATTTGCAAAAACATA 

CCTCTGACAGTAACTTTGATAGATGGTTTGTCAAATGTATCTTTCTGGGTATCCACACCT 

CTTGGCAATGAAATTTGCAGCTCGTCCCTTGCATAAATGAAGTGTCTTTCCCCACCATTT 

GAATCTGGGCTGGCACTGTGACTTGATTTGATCAATAGAATGTGGAAGAAGTGACTGTAT 

GCCAGTTCCAAGCCTAGGTTTCAAGAGGCCTTATAAATGTCTGTTGGAACCTTACCCAGC 

CATGGACATGTTGAGTGAGGATGGTGGAGAATGAGAGAGCAGATGAAGCAGAAAGATGGT 

TTCCTAGCTGAAGTCATACTAGCCCAACCAACATGGCAGCTAACACATGAATGAGGCCAA 

TCAAGACCAGAAGAACCACTCAAGCAGATGCCAGCGCAAATTGGCGATTCAGAGAATCAG 

GAGCTAAATAAATTACTGTTGTCTTTT 

SEQ ID NO: 68_H85811_H 

CGCCCGGCCCCCTCCCCCGGCGCCGGCCACGGGAGGCGGTGATGCGGGCGCGGGGGGCCT 
CGGGTGCGCCGAGAGCGGAGACACAGGGTCAAGATGGGAGATTCCGAGTGAGGGTGGGGG 
GGGCGAGCTGGCGCGCCGCTTTCGCGTGGCCGTTGCCATGAACGGCGGACACCGCGGCCC 
CGATGGCCCCCGTGTACGAAGGTATGGCCTCACATGTGCAAGTTTTCTCCCCTCACACCC 
TTCAATCAAGTGGGTTCTGTAGTGTGAAGAAACTGAAAATAGAGCCGAGTTCGAAGTGGG 
ACATGACTGGGTACGGCTCCCACAGCAAAGTGTATAGCCAGAGCAAGAACATCCCCCTGT 
CGCAGCCAGCCACCACAACCGTCAGCACCTCCTTGCCGGTCCCAAACCCAAGCCTACCTT 
ACGAGCAGACCATCGTCTTCCCAGGAAGCACCGGGCACATCGTGGTCACCTCAGCAAGCA 
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GCACTTCTGTCACCGGGCAAGTCCTCGGCGGACCACACAACCTAATGCGTCGAAGCACTG 
TGAGCCTCCTTGATACCTACCAAAAATGTGGACTC7VAGCGTAAGAGCGAGGAGATCGAGA 
ACACAAGCAGCGTGCAGATCATCGAGGAGCATGCACCCATGATTCAGAATAATGCA-AGC'J 
GGGCCACTGTCGCCACTGCCACCACGTCTACTGCCACCTCCAAAAACAGCGGCTtZCAACA 
GCGAGGGCGACTATCAGCTGGTGCAGCATGAGGTACTGTGCTCCATGACCAACACCTACG 
AGGTCTTAGAGTTCTTGGGCCGAGGGACGTTTGGGCAAGTGGTCAAGTGCTGGAAACGGG 
GCACCAATGAGATCGTAGCCATCAAGATCCTGAAGAACCACCCATCCTATGCCCGACAAG 
GTCAGATTGAAGTGAGCATCCTGGCCCGGTTGAGCACGGAGAGTGCCGATGACTATAACT 
TCGTCCGGGGCTACGAATGCTTCCAGCACAAGAACCACACGTGGTTGGTCTTGGAGATGT 
TGGAGCAGAACCTCTATGACTTTCTGAAGCAAAACAAGTTTAGCCCCTTGCCCCTCAAAT 
ACATTCGCCCAGTTCTCCAGCAGGTAGCCACAGCCCTGATGAAACTCAAAAGCCTAGGTC 
TTATCCACGCTGACCTCAAACCAGAGAACATCATGCTGGTGGATCCATCTAGACAACCAT 
ACAGAGTCAAGGTCATCGACTTTGGTTCAGCCAGCCACGTCTCCAAGGCTGTGTGCTCCA 
CCTACTTGCAGTCCAGATATTACAGGGCCCCTGAGATCATCCTTGGTTTACCATTTTGTG 
AGGCAATTGACATGTGGTCCCTGGGCTGTGTTATTGCAGAATTGTTCCTGGGTTGGCCGT 
TATATCCAGGAGATTCGGAGTATGATCAGATTCGGTATATTTCACAAACACAGGGTTTGC 
CTGCTGAATATTTATTAAGCGCCGGGACAAAGACAACTAGGTTTTTCAACCGTGACACGG 
ACTCACCATATCCTTTGTGGAGACTGAAGAGACCAGATGAGCATGAAGGAGAGACAGGGA 
TTAAGTCAAAAGAAGCAAGAAAGTACATTTTGAACTGTTTAGATGATATGGCGCAGGTGA 
ACATGACGACAGATTTGGAAGGGAGCGACATGTTGGTAGAAAAGGCTGACCGGCGGGAGT 
TCATTGACCTGTTGAAGAAGATGCTGACCATTGATGCTGACAAGAGAATCACTCCAATCG 
AAACCCTGAACCATCCCTTTGTCACCATGACACACTTACTCGATTTTCCCCACAGCACAC 
ACGTCAAATCATGTTTCCAGAACATGGAGATCTGGAAGCGTCGGGTGAATATGTATGACA 
CGGTGAACCAGAGCAAAACCCCTTTCATCACGCACGTGGCCCCCAGCACGTCCACCAACC 
TGACCATGACCTTTAACAACCAGCTGACCACTGTCCACAACCAGCCCTCAGCGGCATCCA 
TGGCTGCAGTGGCCCAGCGGAGCATGCCCCTGCAGACAGGAACAGCCCAGATTTGTGCCC 
GGCCTGACCCGTTCCAGCAAGCTCTCATCGTGTGTCCCCCCGGCTTCCAAGGCTTGCAGG 
CCTCTCCCTCTAAGCACGCTGGCTACTCGGTGCGAATGGAAAATGCAGTTCCCATCGTCA 
CTCAAGCCCCAGGAGCTCAGCCTCTTCAGATCCAACCAGGTCTGCTTGCCCAGCAGGCTT 
GGCCAAGTGGGACCCAGCAGATCCTGCTTCCCCCAGCATGGCAGCAACTGACTGGAGTGG 
CCACCCACACCTCAGTGCAGCATGCCACCGTGATTCCCGAGACCATGGCAGGCACCCAGC 
AGCTGGCGGACTGGAGAAATACGCATGCTGACGGAAGCCATTATAATCCCATCATGGAGG 
AGCCTGCACTATTGACCGGTCATGTGACCCTTCCAGCAGCACAGCCCTTAAATGTGGGTG 
TGGCCCACGTGATGCGGCAGCAGCCAACCAGCACCACCTCCTCCCGGAAGAGTAAGCAGC 
ACCAGTCATCTGTGAGAAATGTCTCCACCTGTGAGGTGTCCTCCTCTCAGGCCATCAGCT 
CCCCACAGCGATCCAAGCGTGTCAAGGAGAACACACCTCCCCGCTGTGCCATGGTGCACA 
GTAGCCCGGCCTGCAGCACCTCGGTCACCTGTGGGTGGGGCGACGTGGCCTCCAGCACCA 
CCCGGGAACGGCAGCGGCAGACAATTGTCATTCCCGACACTCCCAGCCCCACGGTCAGCG 
TCATCACCATCAGCAGTGACACGGACGAGGAGGAGGAACAGAAACACGCCCCCACCAGCA 
CTGTCTCCAAGCAAAGAAAAAACGTCATCAGCTGTGTCACAGTCCACGACTCCCCCTACT 
CCGACTCCTCCAGCAACACCAGCCCCTACTCCGTGCAGCAGCGTGCTGGGCACAACAATG 
CCAATGCCTTTGACACCAAGGGGAGCCTGGAGAATCACTGCACGGGGAACCCCCGAACCA 
TCATCGTGCCACCCCTGAAAACCCAGGCCAGCGAAGTATTGGTGGAGTGTGATAGCCTGG 
TGCCAGTCAACACCAGTCACCACTCGTCCTCCTACAAGTCCAAGTCCTCCAGCAACGTGA 
CCTCCACCAGCGGTCACTCTTCAGGGAGCTCATCTGGAGCCATCACCTACCGGCAGCAGC 
GGCCGGGCCCCCACTTCCAGCAGCAGCAGCCACTCAATCTCAGCCAGGCTCAGCAGCACA 
TCACCACGGACCGCACTGGGAGCCACCGAAGGCAGCAGGCCTACATCACTCCCACCATGG 
CCCAGGCTCCGTACTCCTTCCCGCACAACAGGCGCAGCCACGGCACTGTGCACCCGCA/r I" 
TGGCTGCAGCCGCTGCCGGTGGGGAGCTCGGGACGGAGGCCGACCTCTACACCTACAGTG 
CGCCGGCGGCCCTGGGCTCCACCGGCACCGTGGCCCACCTGGTGGCCTCGCAAGGCTCTG 
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CGCGCCACACCGTGCAGCACACTGCCTACCCAGCCAGCATCGTCCACCAGGTCCCCGTGA 
GCATGGGCCCCCGGGTCCTGCCCTCGCCCACCATCCACCCGAGTCAGTATCCAGCCCAAT 
TTGCCCACCAGACCTACATCAGCGCCTCGCCAGCCTCCACCGTCTACACTGGATACCCAC 
TGAGCCCCGCCAAGGTCAACCAGTACCCTTACATATAAACACTGGAGGGGAGGGAGGGAG 
GGAGGGAGGGAGAGAATGGCCCGAGGGAGGAGGGAGAGAAGGAGGGAGGCGCTCCTGGGA 
CCGTGGGCGGTGGCCTTTTATACTGAAGATGCCGCACACAAACAATGCAAACGGGGCAGG 
GGCGGGGGGGGGGGGGCAGAGGGCAGGGGGACGGGTCGGGACACCAGTGAAACTTGAACC 
GGGAAGTGGGAGGACGTAGAGCAGAGAAGAGAACATTTTTAAAAGGAAGGGATTAAAGAG 
GGTGGGAAATCTATGGTTTTTATTTTAAAAAAG 

SEQ ID NO: 6 9__DYRK3_H 

CGGGAGCGAAAGTGCGCTGAGCTGCAGTGTCTGGTCGAGAGTACCCGTGGGAGCGTCGCG 

CGGCGGAGGCAGCCGTCCCGGCGTAGGTGGCGTGGCCGACCGGACCCCCAACTGGCGCCT 

CTCCCCGAGCGGGGTCCCGAGCTAGGAGATGGGAGGCACAGCTCGTGGGCCTGGGCGGAA 

GGATGCGGGGCCGCCTGGGGCCGGGCTCCCGCCCCAGCAGCGGAGTTGGGGGATGGTGTC 

TATGACACCTTCATGATGATAGATGAAACCAAATGTCCCCCCTGTTCAAATGTACTCTGC 

AATCCTTCTGAACCACCTCCACCCAGAAGACTAAATATGACCGCTGAGCAGTTTACAGGA 

GATCATACTCAGCACTTTTTGGATGGAGGTGAGATGAAGGTAGAACAGCTGTTTCAAGAA 

TTTGGCAACAGAAAATCCAATACTATTCAGTCAGATGGCATCAGTGACTCTGAAAAATGC 

TCTCCTACTGTTTCTCAGGGTAAAAGTTCAGATTGCTTGAATACAGTAAAATCCAACAGT 

TCATCCAAGGCACCCAAAGTGGTGCCTCTGACTCGAGAACAAGCCCTG 

CACCACCTCACTGCCTATGAGAAACTGGAAATAATTAATTATCCAGAAATTTACTTTGTA 

GGTCCAAATGCCAAGAAAAGACATGGAGTTATTGGTGGTGCCAATAATGGAGGGTATGAT 

GATGCAGATGGGGCCTATATTCATGTACCTCGAGACCATCTAGCTTATCGATATGAGGTG 

CTGAAAATTATTGGCAAGGGGAGTTTTGGGCAGGTGGCCAGGGTCTATGATCACAAACTT 

GGACAGTACGTGGCCCTAAAAATGGTGCGCAATGAGAAGCGCTTTCATCGTCAAGCAGCT 

GAGGAGATCCGGATTTTGGAGCATCTTAAGAAACAGGATAAAACTGGTAGTATGAACGTT 

ATCCACATGCTGGAAAGTTTCACATTCCGGAACCATGTTTGCATGGCCTTTGAATTGCTG 

AGCATAGACCTTTATGAGCTGATTAAAAAAAATAAGTTTCAGGGTTTTAGCGTCCAGTTG 

GTACGCAAGTTTGCCCAGTCCATCTTGCAATCTTTGGATGCCCTCCACAAAAATAAGATT 

ATTCACTGCGATCTGAAGCCAGAAAACATTCTCCTGAAACACCACGGGCGCAGTTCAACC 

AAGGTCATTGACTTTGGGTCCAGCTGTTTCGAGTACCAGAAGCTCTACACATATATCCAG 

TGTCGGTTCTACAGAGCTCCAGAAATCATCTTAGGAAGCCGCTACAGCACACCAATTGAC 

ATATGGAGTTTTCGCTGCATCCTTGCAGAACTTTTAACAGGACAGCCTCTCTTCCCTGGA 

GAGGATGAAGGAGACCAGTTGGCCTGCATGATGGAGCTTCTAGGGATGCCACCACCAAAA 

CTTCTGGAGCAATCCAAACGTGCCAAGTACTTTATTAATTCCAAGGGCATACCCCGCTAC 

TGCTCTGTGACTACCCAGGCAGATGGGAGGGTTGTGCTTGTGGGGGGTCGCTCACGTAGG 

GGTAAAAAGCGGGGTCCCCCAGGCAGCAAAGACTGGGGGACAGCACTGAAAGGGTGTGAT 

GACTACTTGTTTATAGAGTTCTTGAAAAGGTGTCTTCACTGGGACCCCTCTGCCCGCTTG 

ACCCCAGCTCAAGCATTAAGACACCCTTGGATTAGCAAGTCTGTCCCCAGACCTCTCACC 

ACCATAGACAAGGTGTCAGGGAAACGGGTAGTTAATCCTGCAAGTGCTTTCCAGGGATTG 

GGTTCTAAGCTGCCTCCAGTTGTTGGAATAGCCAATAAGCTTAAAGCTAACTTAATGTCA 

GAAACCAATGGTAGTATACCCCTATGCAGTGTATTGCCAAAACTGATTAGCTAGTGGACA 

GAGATATGCCCAGAGATGCATATGTGTATATTTTTATGATCTTACAAACCTGCAAATGGA 

AAAAATGCAAGCCCATTGGTGGATGTTTTTGTTAGAGTAGACTTTTTTTAAACAAGACAA 

AACATTTTTATATGATTATAAAAGAATTCTTCAAGGGCTAATTACCTAACCAGCTTGTAT 

TGGCCATCTGGAATATGCATTAAATGACTTTTTATAGGTCA 
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SEQ ID NO: 7 0_AA5 8 9 2 4 1 __K DYRK3__M 

CCACGCGTCCGGAGTTGCTAGGAATGCCACCGCAGAAACTTCTGGAGCAATCCAAGCGTG 
C'l'AAGTACTTTATTAAGTCCAAAGGCTTGCGTCGATA.CTGCTCCGTATCTACCCAGACGG 
ACGGGAGGGTGGTGCTTCTCGGGGGTCGCTCACGCAGGGGTAAAAAGCGAGGCCCGCCAG 
GCAGCAAAGACTGGGCAACCGCACTGAAGGGCTGTGGTGACTACTTGTTCATAGAGTTTC 
TGAAACGATGCCTCCAGTGGGACCCCTCTGCCCGCCTCACCCCGGCTCAAGCATTAAGAC 
ATGCTTGGATTAGGAAGTCTACACCCAAACCTCTCACCATGGACAAGGTGCCAGGGAAGG 
GGGTAGTTAACCCTACAAATGCTTTCCAGGGACTGGGTTCCAAGCTGCCTCCAGTCGTTG 
GGATAGCCAGTAAGCTTAAAGGTAACCTAATGTCCGAAACCAGTGGTAGTATACCTCTGT 
GCAGTGTATTGCCAAAGCTGATTAGCTAGTGGACCACTCAGAGACTGATACATATCATAT 
GTATTTTTAATTACCTTGCAAACATGCAAATGGAAAACGGAATAATTGAAGCCCATTCAC 
TGATGGATATGTTTTTGTTAGACTTTTTTTTAACAAGGCAGAACATTTTTATATGACTAT 
AAAAGAACGCTTCAAGGGGTAATGTCAAACCAGCTTGTATTGGCCATCTGGAGTATACAT 
TAAATGACTTTTTCATAGGTC 

SEQ ID NO: 7 1_5R72__1 6__2_H 

GTCGAGGCGCAGCGCTGCCATGGCTGGGGGCCGTGGGGCCCCCGGGCGCGGCCGGGACGA 
GCCTCCGGAGAGCTACCCGCAACGACAGGACCACGAGCTACAGGCCCTGGAGGCCATCTA 
CGGCGCGGACTTCCAAGACCTGCGGCCGGACGCTTGCGGACCGGTCAAAGAGCCCCCTGA 
AATCAATTTAGTTTTGTACCCTCAAGGCCTAACTGGTGAAGAAGTATATGTAAAAGTGGA 
TTTGAGGGTTAAATGCCCACCTACCTATCCAGATGTAGTTCCTGAAATAGAGTTAAAAAA 
TGCCAAAGGTCTATCAAATGAAAGTGTCAATTTGTTAAAATCTCGCCTAGAAGAACTGGC 
CAAGAAACACTGTGGGGAGGTGATGATCTTTGAACTGGCTTACCACGTGCAGTCATTTCT 
CAGCGAGCATAACAAGCCCCCTCCCAAGTCTTTTCATGAAGAAATGCTGGAAAGGCGGGC 
TCAGGAGGAGCAGCAGAGGCTGTTGGAGGCCAAGCGGAAAGAAGAGCAGGAGCAACGTGA 
AATCCTGCATGAGATTCAGAGAAGGAAAGAAGAGATAAAAGAAGAGAAAAAAAGGAAAGA 
AATGGCTAAGCAGGAACGTTTGGAAATTGCTAGTTTGTCAAACCAAGATCATACCTCTAA 
GAAGGACCCAGGAGGACACAGAACGGCTGCCATTCTACATGGAGGCTCTCCTGACTTTGT 
AGGAAATGGTAAACATCGGGCAAACTCCTCAGGAAGGTCTAGGCGAGAACGTCAGTATTC 
TGTATGTAATAGTGAAGATTCTCCTGGCTCTTGTGAAATTCTGTATTTCAATATGGGGAG 
TCCTGATCAGCTCATGGTGCACAAAGGGAAATGTATTGGCAGTGATGAACAACTTGGAAA 
ATTAGTCTACAATGCTTTGGAAACAGCCACTGGTGGCTTTGTCTTGTTGTATGAGTGGGT 
CCTTCAGTGGCAGAAAAAAATGGGTCCATTCCTTACCAGTCAAGAAAAAGAGAAGATTGA 
TAAGTGCAAAAAGCAGATTCAAGGAACAGAAACAGAATTC7VACTCACTGGTAAAATTGAG 
CCATCCAAATGTAGTACGCTACCTTGCAATGAATCTCAAAGAGCAAGACGACTCCATCGT 
GGTGGACATTTTAGTGGAGGACATTAGTGGGGTCTCTCTTGCTGCACACCTGAGCCACTG 
AGGCCCCATCCCTGTGCATCAGCTTCGCAGGTACACAGCTCAGCTCCTGTCAGGCCTTGA 
TTATCTGCACAGCAATTCTGTGGTGCATAAGGTCCTGAGTGCATCTAATGTCTTGGTGGA 
TGCAGAAGGCACCGTCAAGATTACGGACTATAGCATTTCTAAGCGCCTCGCAGACATTTG 
CAAGGAGGATGTGTTTGAGCAAACCCGAGTTCGTTTTAGTGACAATGCTCTGCCTTATAA 
AACGGGGAAGAAAGGAGATGTTTGGCGTCTTGGCCTTCTGCTGCTGTCCCTCAGCCAAGG 
ACAGGAATGTGGAGAGTACCCTGTGACCATCCCTAGTGACTTACCAGCTGACTTTCAAGA 
TTTTCTAAAGAAATGTGTGTGCTTGGATGACAAGGAAAGATGGAGTCCCCAGCAGTTGTT 
GAAACACAGCTTTATAAATCCCCAGCCAAAAATGCCTCTAGTGGAACAAAGTCCTGAAGA 
TTCTGGAGGACAAGATTATGTTGAGACTGTTATTGCTAGCAAGGGGGTACCCAGTGCTG'l? 
CTTCTTTAGTGAGACACAGAGACAGTTTTCCCGATACTTCATTGAGTTTGAAGAATTACA. 
ACTTCTTGGTAAAGGAGCTTTTGGAGCTGTCATCAAGGTGCAGAACAAGTTGGACGGCTG 
CTGCTACGCAGTG7^GGGCATCCCCATCAACCCGGCCAGCCGi^CAGTTCCGCAGGATCA-A 
GGGCGAAGTGAGACTGCTGTCACGGCTGCACCATGAGAACATTGTGCGCTACTACAACG'.: 
CTGGATCGAGCGGCACGAGGGGCCGGCGGGACCGGGGACGCGGCCCCCGGACTCCGGGCC 




WO 00/73469 



PCT/USOO/14842 



FIGURE 2GGG 

CCTGGCCAAGGATGACCGAGCTGCACGCGGGCAGCCGGCGAGCGACACAGACGGCCTGGA 
CAGCGTAGAGGCCGCCGCGCCGCCACCCATCCTCAGCAGCTCGGTGGAGTGGAGCACTTC 
GGGCGAGCGCTCGGCCAGTGCCCGTTTCCCCGCCAC'TGGCCCGGGCTCCAGCGATGACGA 
GGACGACGACGAGGACGAGCACGGTGGCGTCTTCTCCCAGTCCTTCCTGCCTGCTTCAGA 
TTCTGAAAGTGATATTATCTTTGACAATGAAGATGAGAACAGTAAAAGTCAGAATCAGGA 
TGAAGATTGGAATGAAAAGAATGGGTGCCATGAAAGTGAGCCATGAGTGACGACTGAGGC 
TGTGCACTACCTATACATCCAGATGGAGTACTGTGAGAAGAGCACTTTACGAGACACCAT 
TGACGAGGGAGTGTATGGAGACACGGTCAGAGTCTGGAGGCTTTTTCGAGAGATTCTGGA 
TGGATTAGCTTATATCCATGAGAAAGGAATGATTCACCGGGATTTGAAGCCTGTCAACAT 
TTTTTTGGATTCTGATGACCATGTGAAAATAGGTGATTTTGGTTTGGCGACAGAGCATCT 
AGCCTTTTCTGCTGACAGCAAACAAGACGATCAGACAGGAGACTTGATTAAGTCAGACCC 
TTCAGGTCACTTAACTGGGATGGTTGGCACTGCTCTCTATGTAAGCCCAGAGGTCCAAGG 
AAGCACCAAATCTGCATACAACCAGAAAGTGGATCTCTTCAGCCTGGGAATTATCTTCTT 
TGAGATGTCCTATCACCCCATGGTCACGGCTTCAGAAAGGATCTTTGTTCTCAACCAACT 
CAGAGATCCCACTTCGCCTAAGTTTCCAGAAGACTTTGACGATGGAGAGCATGCAAAGCA 
GAAATCAGTCATCTCCTGGCTGTTGAACCACGATCCAGCAAAACGGCCCACAGCCACAGA 
GCTGCTCAAGAGTGAGCTGCTGCCCCCACCCCAGATGGAGGAGTCAGAGCTGCATGAAGT 
GGTGCACCACACGCTGACCAACGTGGATGGGAAGGCGTACCGCACCATGATGGCCCAGAT 
CTTCTCGCAGCGCATCTCCCCTGCCATCGATTACACCTATGACAGCGACATACTGAAGGG 
CAACTTCTCAATCCGTACAGGCAAGATGCAGCAGCATGTGTGTGAAACCATCATCCGCAT 
CTTTAAAAGACATGGAGCTGTTCAGTTGTGTACTCCACTACTGCTTGCGCGAAACAGACA 
AATATATGAGCACAACGAAGGTGCCCTATTGATGGACCACAGCGGGATGCTGGTGATGCT 
TCCTTTTGACCTGCGGATCCCTTTTGCAAGATATGTGGCAAGAAATAATATATTGAATTT 
AAAACGATACTGCATAGAACGTGTGTTCAGGGGGCGCAAGTTAGATGGATTTCATCCCAA 
AGAACTTCTGGAGTGTGCATTTGATATTGTCACTTCTACCACCAACAGCTTTCTGCCCAC 
TGCTGAAATTATCTACACTATCTATGAAATCATCCAAGAGTTTCCAGCACTTCAGGAAAG 
AAATTACAGTATTTATTTGAACCATACCATGTTATTGAAAGCAATACTCTTACACTGTGG 
GATCCCAGAAGATAAACTCAGTCAAGTCTACATTATTCTGTATGATGCTGTGACAGAGAA 
GGTGACGAGGAGAGAAGTGGAAGCTAAATTTTGTAATCTGTCTTTGTCTTCTAATAGTCT 
GTGTCGACTCTACAAGTTTATTGAACAGAAGGGAGATTTGCAAGATCTTATGCCAACAAT 
AAATTCATTAATAAAACAGAAAACAGGTATTGCACAGTTGGTGAAGTATGGCTTAAAAGA 
CCTAGAGGAGGTTGTTGGACTGTTGAAGAAACTCGGCATCAAGTTACAGGTCTTGATCAA 
TTTGGGCTTGGTTTACAAGGTGCAGCAGCACAATGGAATCATCTTCCAGTTTGTGGCTTT 
CATCAAACGAAGGCAAAGGGCTGTACCTGAAATCCTCGCAGCTGGAGGCAGATATGACCT 
GCTGATTCCCCAGTTTAGAGGGCCACAAGCTCTGGGGCCAGTTCGCACTGCCATTGGGGT 
CAGCATAGGTATAGACAAGATATCTGCTGCTGTCCTCAACATGGAGGAATCTGTTACAAT 
AAGCTCTTGTGACCTGGTGGTTGTAAGTGTTGGTCAGATGTCTATGTCCAGGGCCATCAA 
CCTAACCCAGAAACTCTGGACAGCAGGCATCACAGCAGAAATCATGTACGACTGGTCACA 
GTGCCAAGAGGAATTACAAGAGTACTGGAGAGATCATGAAATCACGTATGTGGCCCTTGT 
CTGGGATAAAGAAGGAAGCCATGTCAAGGTTAAGTCTTTCGAGAAGGAAAGGCAGACAGA 
GAAGCGTGTGCTGGAGACTGAAGTTGTGGACCATGTACTGCAGAAACTGAGGACTAAAGT 
CACTGATGAAAGGAATGGCAGAGAAGCTTCCGATAATCTTGCAGTGCAAAATCTGAAGGG 
GTCATTTTCTAATGCTTCAGGTTTGTTTGAAATCCATGGAGCAACAGTGGTTCCCATTGT 
GAGTGTGCTAGCCCCGGAGAAGCTGTCAGCCAGCACTAGGAGGCGCTATGAAACTCAGGT 
ACAAACTCGACTTCAGACCTCCCTTGCCAACTTACATCAGAAAAGCAGTGAAATTGAAAT 
TCTGGCTGTGGATCTACCCAAAGAAACAATATTACAGTTTTTATCATTAGAGTGGGATGC 
TGATGAACAGGCATTTAACACAACTGTGAAGCAGCTGCTGTCACGCCTGCCAAAGCAAAG 
ATACCTCAAATTAGTCTGTGATGAAATTTATAACATCAAAGTAGAAAAAAAGGTGTCTGT 
GGTATTTGTGTACAGCTATAGAGATGACTACTACAGAATCTTATTTTAAGCCTAAAGAAC 
TGTCGTTAACCTCATTCAAACAGACAGAGGCTTATACTGGAATAATGGAATGTTGTACAT 
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TCATCATAATTTAAAATTAAATTCTAAGAAGAGGCTGGGTGCAGTGGCTCACACCTTTAA 
TCCCAGCACTTTGGGAAGCCAAGGCAGGAAGACTGCTTGAAACCAGGAGTTTGAGACCAG 
CCT 

SEQ ID NO: 73_R43524_H, HRI_H 

ATGCTGGGGGGCAACTCCGGGGTCCGCAAGCGCGAAGAGGAGGGCGACGGGGCTGGGGCT 

GTGGCTGCGCCGCCGGCCATCGACTTTCCCGCCGAGGGCCCGGACCCCGAATATGACGAA 

TCTGATGTTCCAGCAGAAATCCAGGTGTTAAAAGAACCCCTACAACAGCCAACCTTCCCT 

TTTGCAGTTGCAAACCAACTCTTGCTGGTTTCTTTGCTGGAGCACTTGAGCCACGTGCAT 

GAACCAAACCCACTTCGTTCAAGACAGGTGTTTAAGCTACTTTGCCAGACGTTTATCAAA 

ATGGGGCTGTTGTCTTCTTTCACTTGTAGTGACGAGTTTAGCTCATTGAGACTACATCAC 

AACAGAGCTATTACACACTTAATGAGGTCTGCTAAAGAGAGAGTTCGTCAGGATCCTTGT 

GAGGATATTTCTCGTATCCAGAAAATCAGATCAAGGGAAGTAGCCTTGGAAGCACAAACT 

TCACGTTACTTAAATGAATTTGAAGAACTTGTCATCTTAGGAAAAGGTGGATACGGAAGA 

GTATACAAGGTCAGGAATAAATTAGATGGTCAGTATTATGCAAT7VAAAAAAATCCTGATT 

AAGGGTGCAACTAAAACAGTTTGCATGAAGGTCCTACGGGAAGTGAAGGTGCTGGCAGGT 

CTTCAGCACCCCAATATTGTTGGCTATCACACCGCGTGGATAGAACATGTTCATGTGATT 

CAGCCACGAGCAGACAGAGCTGCCATTGAGTTGCCATCTCTGGAAGTGCTCTCCGACCAG 

GAAGAGGACAGAGAGCAATGTGGTGTTAAAAATGATGAAAGTAGCAGCTCATCCATTATC 

TTTGCTGAGCCCACCCCAGAAAAAGAAAAACGCTTTGGAGAATCTGACACTGAAAATCAG 

AATAACAAGT CGGTG AAGTAC AC CACCAATTTAGTCATAAGAGAATCTGGTGAACTTGAG 

TCGACCCTGGAGCTCCAGGAAAATGGCTTGGCTGGTTTGTCTGCCAGTTCAATTGTGGAA 

CAGCAGCTGCCACTCAGGCGTAATTCCCACCTAGAGGAGAGTTTCACATCCACCGAAGAA 

TCTTCCGAAGAAAATGTCAACTTTTTGGGTCAGACAGAGGCACAGTACCACCTGATGCTG 

CACATCCAGATGCAGCTGTGTGAGCTCTCGCTGTGGGATTGGATAGTCGAGAGAAACAAG 

CGGGGCCGGGAGTATGTGGACGAGTCTGCCTGTCCTTATGTTATGGCCAATGTTGCAACA 

AAAATTTTTCAAGAATTGGTAGAAGGTGTGTTTTACATACATAACATGGGAATTGTGCAC 

CGAGATCTGAAGCCAAGAAATATTTTTCTTCATGGCCCTGATCAGCAAGTAAAAATAGGA 

GACTTTGGTCTGGCCTGCACAGACATCCTACAGAAGAACACAGACTGGACCAACAGAAAC 

GGGAAGAGAACACCAACACATACGTCCAGAGTGGGTACTTGTCTGTACGCTTCACCCGAA 

CAGTTGGAAGGATCTGAGTATGATGCCAAGTCAGATATGTACAGCTTGGGTGTGGTCCTG 

CTAGAGCTCTTTCAGCCGTTTGGAACAGAAATGGAGCGAGCAGAAGTTCTAACAGGTTTA 

AGAACTGGTCAGTTGCCGGAATCCCTCCGTAAAAGGTGTCCAGTGCAAGCCAAGTATATC 

CAGCACTTAACGAGAAGGAACTCATCGCAGAGACCATCTGCCATTCAGCTGCTGCAGAGT 

GAACTTTTCCAAAATTCTGGAAATGTTAACCTCACCCTACAGATGAAGATAATAGAGCAA 

GAAAAAGAAATTGCAGAACTAAAGAAGCAGCTAAACCTCCTTTCTCAAGACAAAGGGGTG 

AGGGATGACGGAAAGGATGGGGGCGTGGGATGA 

SEQ ID NO: 74_1 7 0 0 0 0 5 7 5 1 9 4 5 7_H 

CACAAGAGCCCTTCCTGCAGGGAACCTCAGGCTTCAGAGAGCCGAAAAGTTGGGAGGCGT 
AACCACTTACAGGCCGGAAGTGTCCGGGGTGGACGCATTCGGGTAGCCGAAGAAGTCCCA 
GGATTGCCGAAGAAGTCCCAGGATTTCCGAAGCGAGCCGAAGCATCGCGACAGTTTTCAG 
AGACAGCTGATCGGTTGGAGCTGTTGCGCCGAGCAGTCATGGCGGCGGCCAGAGCTACTA 
CGCCGGCCGATGGCGAGGAGCCCGCCCCGGAGGCTGAGGCTCTGGCCGCAGCCCGGGAGC 
GGAGCAGCCGCTTCTTGAGCGGCCTGGAGCTGGTGAAGCAGGGTGCCGAGGCGCGCGTGT 
TCCGTGGCCGCTTCCAGGGCCGCGCGGCGGTGATCAAGCACCGCTTCCCCAAGGGCTACC 
GGCACCCGGCGCTGGAGGCGCGGCTTGGCAGACGGCGGACGGTGCAGGAGGCCCGGGCGC 
TCCTCCGCTGTCGCCGCGCTGGAATATCTGCCCCAGTTGTCTTTTTTGTGGACTATGCTT 
CCAACTGCTTATATATGGAAGAAATTGAAGGCTCAGTGACTGTTCGAGATTATATTCAGT 
CCACTATGGAGACTGAAAAAACTCCCCAGGGTCTCTCCAACTTAGCCAAGACAATTGGGC: 
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AGGTTTTGGCTCGAATGCACGATGAAGACCTCATTCATGGTGATCTCACCACCTCCAACA 

TGCTCCTGAAACCCCCCCTGGAACAGCTGAACATTGTGCTCATAGACTTTGGGCTGAGTT 

TCATTTCAGCACTTCCAGAGGATAAGGGAGTAGACCTCTATGTCCTGGAGAAGGCCTTCC 

TCAGTACCCATCCCAACACTGAAACTGTGTTTGAAGCCTTTCTGAAGAGCTACTCCACCT 

CCTCCAAAAAGGCCAGGCCAGTGCTAAAAAAATTAGATGAAGTGCGCCTGAGAGGAAGAA 

AGAGGTCCATGGTTGGGTAGAAGAATGTGTATGACAACCACACACAGTGAAGCTCTTTTT 

TCAAAGTAAATTTGAAGAAATGCTACAAGTATGAGATGAGATCTAAGTAAAGGTGTTAAG 

ATATTTTTAAGTGGTATGTGATCGTGTCATTATCATCTGCACTTCACTGAAGAGCTTACT 

ATGTGTCTAAGTCATGTTCTAGGCAGAATTGGGTATTTAAAGTAAATTGAGGACAGGCTT 

CTCCCAGATTGTGACATGTATATCTCAGATACATGGGTGTGGCATTGAACCACATAATGA 

GAACATTATTCTCTTTTTAGTCCTTGTGAGACAAGGATGAAGTCTCAGTTGCTGATACTC 

GCTGAGCTTACTGGCCCTCTAACCCAGTGTTTTTTTTTGTTGTTGTTGTGTACATGTTAT 

ATTTATTTTGAAACCAGTTTAATGGGATACAACCAGCATTTTAAAAAATGAAATAGAATA 

CAGGATGGT^AAATATCAGTGTATTGTTTTATGAAACTTTCACGTGTATATATAGACCAAG 

GATATGTGCTGAGTTTTGATGTCAAATATATTTCTGTTTCAGGGTCATGATGAAAAAATG 

AAAAGTCTGCTTAACTCCAATTTCTCTTTTAAAAAAGCAGACTTACAGCTTTCAGGCAAC 

TGAAATTCATGTTAACATGTTTTTATTTTTATTGCTTTGTATTTTTGTGGTTACGTTGTA 

AGACAAGTGATTGATCTAAAGTTCCTTTTAAGTTTATACCGCTAAACAAACTGAGTTGAT 

TTCTATCACAGGCAGTAAGTAGGTAGAGCAAAAATGGTGAAGTGACTTGTGAAGACTGAA 

GTTTGATGAAGTCTGGTTTAAGGCACAGGTAAACTGAGTGTGGATGCAAAAGTACCAGGA 

GCTAGCTTTTAACGTTGCCCAGGCTCAGTTTCTTTTCTTAGAAGAAGCTATGTTTGGGTG 

GGAAGGGAAGAGAGGGATAAGAAAATAGGTTTCTTCCTTGTAAACTCCAATCAACAAACA 

TATTTTGAGTGCCTTTTGTGTTCCTTGGCACCGTGTTGGGTATTGGGTACTTGGCACCCT 

GTTGGGTATTGGGTACAATGGTGAGCCAGACAGACACAGCGGCTGTCCTTTTGTAAGAAT 

ATTTATTTTTATAAAAAAG TATAAAG TATACAGTG GG ATGTTTTG AT AT AC A TTATG AAA 

TGATTGCTACAGCTGAGCTAATTAACACCCATCACCTCACATAGTTACTGTCTTGTTTCT 

TAATATGGACATTTGCAGCTATGAATTTCCCTCTGCACACTGTTGTCATCACACACTCTC 

AGTTTTGGTATTTTGTGTTTTTGTTTTCATTCATCTCAAAGTATTTTCTAATTTCCCTTG 

TGATTTCTTCTTTGACCCCTTGATTGTTTAGAAATCTGTTAATTTCCACACATTTGTAAA 

TGTTCCAATTTTTCTTTTGTTATTGCCAGCTTCATTCCATTGTGTTCAGAGATGATACAG 

TCAGTGCCTGTTCTTATGAAGCAAACATTCTATAATAGTAGGACCAGTACCCTGTCTGTT 

TCATTCACCACAGTCAGCATGCCCCAAGTGCCCAGCATGGGGCGGATGGCCAGGAATGAG 

TGAAAACTTCCCTTCCTGGGTAGTTGTGACTAGTAGAGAGGAAAAATAATATAATTGCCT 

GCTTACTGCATGCCAGGCATTGGGCTGGGAATTTTTATATTGGATCTAAAATAACTCTTA 

AGTTAGGCATTATCCCCATTTTATAGATGGAGAAACTGGCCCCAAAAGGTGGGAACTTGT 

CCAAGACGTCACAGGTAGCAAGAGGTACTTTTACCTGGCTCCAAATCTGTGTTCTTTCCA 

CTGACAAATGAGATATGGGATATGGTGCATCTTTACAGTACTATAATAAGTATTGGCGTA 

TAACATTATTTTCAAGGAACTCCAAGGGCCACAGGAGCTGACAGGTTTTTCAATTAATAT 

TCCCAACATGAATGAGATGCCTCATTCCTCAGTTTCCTCACGTGTACTATAAGGCTAGTA 

CCTGCTTTGTTGGGGTATGGTTGGCTCGTGTGCATTAAGTCAACAAATCCCTAGT 

SEQ ID NO: 7 5_AA0 1 3 52 4_M 

CTGGTGCAGCAGGGCGCCGAGGCGCGCGTTTTCCGTGGCCGCTTCCAGGGCCGCGCGGCC 
GTGGTGAAGCACCGCTTCCCGAAGAGTTACCGGCACCCGGAGCTGGAGGCGCGGCTCGGC 
CGTCGGCGGACGGTGCAGGAGGCGCGCGCGCTGCTCCGCTGCCGCCGTGCGGGGATAGCT 
GCCCCAGTCGTCTTCTTTGTGGACTATGCGTCTAACTGCTTATATATGG7VAGAAATCGAA 
GACTCGGTGACTGTTCGGGATTATATCCAATCCACTATGGAGACTGAAAAGGACCCCCAG 
TGCCTCTTGGACCTGGCCAGGAGGATGGGGCAGGTTCTGGCCGGAATGCACGACCAAGAC 
CTCATTCACGGGGACCTCACCACCTCCAACATGCTCCTGAGGCGGCCCCTGGCGCAGCTG 
CACATCGTGCTCATCGACTTTGGGCTGAGCTTTGTCTCAGGACTGCCGGAAGATAAAGGC 
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GTCGACCTCTATGTCCTGGAGAAGGCCTTCCTCAGCACGCACCCCCACACCGAGACCGCG 
TTTGAAGCCTTTCTGAAGAGTTACGGGGCCTCGTCCAAGAAGTCCAGTCCAGTGCTGAAG 
AAGTTAGATGAGGTGCGCCTGAGAGGGCGAAAGCGGTCCATGGTCGGGTAGTGGAGCTG'i' 
GGTGAACTGGCTCACGGTGAAGGATGATGTAGACGAGGCTGGACCCCTCAGCAAAGCATG 
GGTTGTTAAGTGGTCTGTGATCGTGCTGGGCCACCACCATCCATGGCTCACTGTTCTCAi; 
GGGCTTCATGTAGATGAGGTTTATTCTGGGCAGAACTGGGTAGGTAGCCGAGGCTAGGCT 
TGAATTTATGGCAACATCCTACCTCAGCTTGCTTGGAAGAGGTTATAAGCCACCATACCT 
GACTTTGCACTGATTCTGTCAGAAAC 

SEQ ID NO: 7 6_1 7 0 0 0 1 3 9 8 0 1 1 9 1 _H , IRAKM_H 

ATGGCGGGGAACTGTGGGGCCCGCGGCGCGCTGTCGGCGCACACGCTGCTGTTCGACCTG 
CCGCCCGCGCTGCTCGGAGAGCTCTGCGCTGTTCTGGACAGGTGCGACGGCGCGGTGGGG 
TGGCGCGGCCTGGCAGAGAGACTTTCAAGCAGCTGGCTGGATGTTCGTCATATTGAAAAG 
TATGTAGACCAAGGTAAAAGTGGAACAAGAGAATTACTTTGGTCCTGGGCACAGAAAAAC 
AAGACCATCGGTGACCTTTTACAGGTCCTCCAGGAGATGGGACATCGTCGAGCTATTCAT 
TTAATTACAAACTATGGAGCAGTGTTGAGTCCTTCAGAGAAGAGTTATCAGGAAGGTGG/A 
TTTGCAAATATATTATTCAAGGAAACAGCCAATGTCACCGTGGATAATGTTCTTATTCCT 
GAACATAATGAAAAAGGAGTACTGCTTAAATCTTCCATCAGCTTTCAAAATATCATAGAA 
GGAACTAGAAATTTCCACAAAGACTTCCTAATTGGAGAAGGAGAGATTTTTGAGGTATAC 
AG AG T GG A G ATT C AAAAC CTAAG AT AT GCTGTCAAATT ATT TAAACAGG AG AAAAAAATG 
CAGTGTAAGAAGCATTGGAAGAGGTTTTTATCTGAGCTTGAAGTTTTACTACTGTTTCAT 
CACCCAAACATACTAGAGTTGGCTGCATATTTTACAGAGACTGAGAAGTTCTGTCTGATT 
TATCCATACATGAGAAATGGAACACTTTTTGAGAGATTGCAGTGTGTAGGTGACACGGCG 
CCAGTCCCTTGGCACATTCGAATCGGTATATTAATAGGAATATGCAAAGCCATTCACTAO 
CTGCACAACGTTCAACCATGCTCGGTCATCTGTGGCAGTATATCAAGTGCAAACATCCTT 
TTGGATGATCAGTTTCAACCCAAACTAACTGATTTTGCCATGGCACACTTCCGGTCCCAG 
CTAGAACATCAGAGTTGTACCATAAATATGACCAGCAGCAGCAGTAAACATCTGTGGTAC 
ATGCCAGAAGAGTACATCAGACAGGGGAAACTTTCCATTAAAACAGATGTCTACAGCTTT 
GGAATTGTAATAATGGAAGTTCTAACAGGATGTAGAGTAGTGTTAGATGATCCAAAACAT 
ATCCAGCTGCGGGATCTCCTTAGAGAATTGATGGAGAAGAGAGGCCTGGATTCATGTCTC 
TCATTTCTAGATAAGAAAGTGCCTCCCTGCCCTCGGAATTTCTCTGCCAAGCTCTTCTGT 
TTGGCAGGCCGGTGTGCTGCAACGCGGGCAAAGTTAAGACCATCAATGGATGAAGTTTTA 
AATACTCTTGAAAGTACTCAAGCCAGCTTGTATTTTGCTGAAGATCCTCCCACATCACTA 
AAGTCCTTCAGGTGTCCTTCTCCTCTATTCGTGGAGAATGTACCAAGTATTCCAGTGGAA 
GATGATGAAAGCCAGAATAACAATTTACTACCTTCTGATGAAGGCCTGAGGATAGACAGA 
ATGACTCAGAAAACTCCTTTTGAATGCAGCCAGTCTGAGGTTATGTTTGTGAGGTTGGAC 
AAAAAGCCAGAGAGCAAGAGAAATGAGGAAGCTTGCAACATGCCCAGTTCTTCTTGTGAA 
GAAAGTTGGTTCCCAAAGTATATAGTTCCATCCCAGGACTTAAGGCCCTATAAGGTAAAT 
ATAGATCCTTCTTCAGAAGCTCCAGGGCATTCTTGCAGGAGCAGGCCAGTGGAGAGCAGG 
TGTTCGTGCAAATTTTCCTGGGATGAATATGAACAGTACAAAAAAGAATAA 



SEQ ID NO: 7 7_AA84 0 5 9 8_M IRAKM__M 

ATGTGGAAGAGATTTTTATCAGAACTGGAAGTTCTACTCCTGTTCCGTCACCCCCACAT/i 
CTAGAGCTGGCTGCATATTTCACGGAGACTGAGAAACTTTGTCTGGTTTATCCCTATATG 
AGCAACGGGACGCTTTTCGACAGATTACAGTGCACAAATGGCACAACCCCGCTTTCCTGG 
CACGTTCGAATCAGCGTATTGATAGGAATAGGCAAAGCCATCCAATACTTGCACAACACT 
CAGCCGTGCGCCGTCATCTGTGGCAACGTTTCCAGTGCAAACATACTCTTGi jATGACCA' ; 
CTCCAACCCAAACTAAC(jGATTTTGCTGCAGCGCACTTCCGACCCAATCTA('3AGCAGCA(J 
AGTTCTACCATAAATATGACCGGCGGTGGCAGGAAACATCTGTGGTACATGCCAGAAGA/'. 
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TACATCAGACAGGGAAGACTTTCCGTTAAAACTGATGTCTACAGCTTCGGAATCGTGATC 
ATGGAGGTTCTAAGGGGCTGCAAAGTGGTGCTGGATGACCCGAAACACGTTCAGGTGCGG 
GACCTCCTCATGGAACTGATGGAGAAAAGAGGCCTAGACTCCTGCCTGTCCTTCTTAGAC 
AGGAAGATACCACCCTGTGCTCGGAACTTCTCTGCAAAGCTCTTCTCTCTGGCGGGCCGG 
TGTGTGGCAACGAAGGCCAAGTTAAGACCCACGATGGACGAAGTCCTGTCCTCTCTGGAG 
AGCACCCAGCCTAGCTTGTATTTTGCAGAAGACCGTCCCACGTCCTTGAAGTCCTTCAGG 
TGTCCTTCTCCACTGTTCTTGGATAATGTCCCAAGTATTCCAGTAGAAGATGATGAAAAC 
CAGAATAACCATTCAGTACCTCCCAAGGAAGTTTTGGGGACAGATAGAGTGACTCAGAAA 
ACCCCCTTTGAATGCAGCCAGTCTGAGGTCACCTTTGTAGGCTTGGACCGAAACAGAGGG 
AACAGGGGAAGTGAAGGGGATTGCAACGTGCCCAGTTCTTCTCATGAGGAATGCTGGTCC 
CCAGAGCTTGTGGCGCCATCCCAGGACTTAAGTCCTACTGTGATCAGTTTGGGCTCGTCT 
TGGGAAGTACCAGGCCATTCTTATGGGAGCAAGCCAATGGAGAAGAGGTGTTCCTCTGGG 
CTCTTTTGCAGTGAGCATGAACAGTCCAAAAAGCAGTGAATCCACCAGAAGATCAAGCAA 
AAAATAAAAGCAAACGTCACTGAAGGCACTGAGCAAATAGCATCCCCGTGAAAAGACACG 
AGCTCTGAGCTCCGTGAGTACAGCCAAGGGACCAACTGATGGAGAATTTGAATGGTGCAG 
ATTAGCAGCAAGGAAGTCTATTCCTTCCTCCAAACAGAATAATTTCAAGAGATGCTTTAT 
TCAAGTGACCGCCTCTCAGTCAAACCTGAGAAGCTAAACTGGAGCCAATCAGAATTATCC 
AAGATTCCGGGTTCTGACAACCAAAACCTAGCAAAGAGTAGCAGGACAAGTCTCTCTCTT 
AAGTCTCTCACTCTCTCTCATCATCCGAGTGAGATCTTGGTATAGGTGAACAGAGAACCA 
CCCACCTTCCAGAACCAGAACCACCTTCTCCCCAAGCCAGCAGTCAGTCACTCACCATCA 
GCAGCCAGTAGTCACCAGCAGCCAATCATGATACAGTGTCACTCTCCCTCTGCGCATGCC 
CTACGTCCTTTATAAAACCCAGGTCTTCAGGGCCCACCCCTTTCTTTTTCCATCCTTGCT 
CAGAGGCAGCCTTTTGTATACATTCCCTGACCCCACCCCAATTATATCTCTCATATGATA 
TCTGTTGCGTAGTGTGACTTTGTGGCATGACTTGGTTGTCAGATCATTTGCACAAGAACA 
AGCGAATACACAACAACAAAGCCCACCATCATTACCACCGGCACTTAATGCTAGTCTTTC 
TGCTAGGGATACTGACAGTCTATTTGCTTCCCATGGTCATAGGGAAGTTGCTCAAATGCA 
AAGGTTGTAGGGAATGTCTAATTTGTAAATGGCGTCGGGTGCCTTTGGAAGGAATTGTGT 
TTTTACAGCCAGTTGCTACTCTTGTTTATCGCTGGTTAACCGGTCTGTCCGGAAGTGAGC 
CAAGTCATCCTTGCTAGGGCTTTTTCTGTGTAGAGAGGGAATTCCAGTCCAAAGTCTGCT 
TCTCTGTATTTAAATTCTTAGAAGAGTTGCCTGTGGCATTCCAATTGTTATATAAAAAAA 

TTATATTAAAGAATTCCAGCACT 
SEQ ID NO: 7 8_AA0 8 8 54 7_H 

ATGGCGAGTGCGGTCAGGGGGTCGAGGCCGTGGCCCCGGCTGGGGCTCCAGCTCCAGTTC 
GCGGCGCTGCTGCTCGGGACGCTGAGTCCACAGGTTCATACTCTCAGGCCAGAGAACCTC 
CTGCTGGTGTCCACCTTGGATGGAAGTCTCCACGCACTAAGCAAGCAGACAGGGGACCTG 
AAGTGGACTCTGAGGGATGATCCCGTCATCGAAGGACCAATGTACGTCACAGAAATGGCC 
TTTCTCTCTGACCCAGCAGATGGCAGCCTGTACATCTTGGGGACCCAAA7VACAACAGGGA 
TTAATGAAACTGCCATTCACCATCCCTGAGCTGGTTCATGCCTCTCCCTGCCGCAGCTCT 
GATGGGGTCTTCTACACAGGCCGGAAGCAGGATGCCTGGTTTGTGGTGGACCCTGAGTCA 
GGGGAGACCCAGATGACACTGACCACAGAGGGTCCCTCCACCCCCCGCCTCTACATTGGC 
CGAACACAGTATACGGTCACCATGCATGACCCAAGAGCCCCAGCCCTGCGCTGGAACACC 
ACCTACCGCCGCTACTCAGCGCCCCCCATGGATGGCTCACCTGGGAAATACATGAGCCAC 
CTGGCGTCCTGCGGGATGGGCCTGCTGCTCACTGTGGACCCAGGAAGCGGGACGGTGCTG 
TGGACACAGGACCTGGGCGTGCCTGTGATGGGCGTCTACACCTGGCACCAGGACGGCCTG 
CGCCAGCTGCCGCATCTCACGCTGGCTCGAGACACTCTGCATTTCCTCGCCCTCCGCTGG 
GGCCACATCCGACTGCCTGCCTCAGGCCCCCGGGACACAGCCACCCTCTTCTCTACCTTG 
GACACCCAGCTGCTAATGACGCTGTATGTGGGGAAGGATGAAACTGGCTTCTATGTCTCT 
AAAGCACTGGTCCACACAGGAGTGGCCCTGGTGCCTCGTGGACTGACCCTGGCCCCCGCA 
GATGGCCCCACCACAGATGAGGTGACACTCCAAGTCTCAGGAGAGCGAGAGGGCTCACCC 
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AGCACTGCTGTTAGATACCCCTCAGGCAGTGTGGCCCTCCCAAGCCAGTGGCTGCTCATT 
GG AC AC G ACG AG GT AC GC C GAGTCGTGC AC AC CACCATGGTG AG GGTCC AT CGG AC CCTG 
GGGAGTGGAACTGCAGAGACAAGACCTCCAGAGAATACCCAGGCCCCAGCCTTCTTCTTG 
GAGCTATTGAGCCTGAGCCGAGAGAAACTTTGGGACTCCGAGCTGCATCCAGAAGAAAA^. 
ACTCCAGACTCTTACTTGGGGCTGGGACCCCAAGACCTGCTGGCAGCTAGCCTCACTGCT 
GTCCTCCTGGGAGGGTGGATTCTCTTTGTGATGAGGCAGGTGGTGGAGAAGCAGCAGGAG 
ACCCCCCTGGCACCTGCAGACTTTGCTCACATCTCCCAGGATGCCCAGTCCCTGCACTCG 
GGGGCCAGCCGGAGGAGCCAGAAGAGGCTTCAGAGTCCCTCAAAGCAAGCCCAGCCACTC 
GACGACCCTGAAGCTGAGCAACTCACCGTAGTGGGGAAGATTTCCTTCAATCCCAAGGAC 
GTGCTGGGCCGCGGGGCAGGCGGGACTTTCGTTTTCCGGGGACAGTTTGAGGGACGGGCA 
GTGGCTGTCAAGCGGCTCCTCCGCGAGTGCTTTGGCCTGGTTCGGCGGGAAGTTCAACTG 
CTGCAGGAGTCTGACAGGCACCCCAACGTGCTCCGCTACTTCTGCACCGAGCGGGGACCC 
CAGTTCCACTACATTGCCCTGGAGCTCTGCCGGGCCTCCTTGCAGGAGTACGTAGAAAAC 
CCGGACCTGGATCGCGGGGGTCTGGAGCCCGAGGTCGTGCTGCAGCAGCTGATGTCTGGC 
CTGGCCCACCTGCACTCTTTACACATAGTGCACCGGGACCTGAAGCCAGGAAATATTCTC 
ATCACCGGGCCTGACAGCCAGGGCCTGGGCAGAGTGGTGCTCTCAGACTTCGGCCTCTGC 
AAGAAGCTGCCTGCTGGCCGCTGTAGCTTCAGCCTCCACTCCGGCATCCCCGGCACGGAA 
GGCTGGATGGCGCCCGAGCTTCTGCAGCTCCTGCCACCAGACAGTCCTACCAGCGCTGTG 
GACATCTTCTCTGCAGGCTGCGTGTTCTACTACGTGCTTTCTGGTGGCAGCCACCCCTTT 
GGAGACAGTCTTTATCGCCAGGCAAACATCCTCACAGGGGCTCCCTGTCTGGCTCACCTG 
GAGGAAGAGGTCCACGACAAGGTGGTTGCCCGGGACCTGGTTGGAGCCATGTTGAGCCCA 
CTGCCGCAGCCACGCCCCTCTGCCCCCCAGGTGCTGGCCCACCCCTTCTTTTGGAGCAGA 
GCCAAGCAACTCCAGTTCTTCCAGGACGTCAGTGACTGGCTGGAGAAGGAGTCCGAGCAG 
GAGCCCCTGGTGAGGGCACTGGAGGCGGGAGGCTGCGCAGTGGTCCGGGACAACTGGCAC 
GAGCACATCTCCATGCCGCTGCAGACAGATCTGAGAAAGTTCCGGTCCTATAAGGGGACA 
TCAGTGCGAGACCTGCTCCGTGCTGTGAGGAACAAGAAGCACCACTACAGGGAGCTCCCA 
GTTGAGGTGCGACAGGCACTCGGCCAAGTCCCTGATGGCTTCGTCCAGTACTTCACAAAC 
CGCTTCCCACGGCTGCTCCTCCACACGCACCGAGCCATGAGGAGCTGCGCCTCTGAGAGC 
CTCTTCCTGCCCTACTACCCGCCAGACTCAGAGGCCAGGAGGCCATGCCCTGGGGCCACA 
GGGAGGTGA 

SEQ ID NO: 7 9_HGP_6 64 4 4 66 

GGAGGGTTCGAATTGCAACGGCAGCTGCCGGGCGTATGTGTTGGTGCTAGAGGCAGCTGC 
AGGGTCTCGCTGGGGGCCGCTCGGGACCAATTTTGAAGAGGTACTTGGCCACGACTTATT 
TTCACCTCCGACCTTTCCTTCCAGGCGGTGAGACTCTGGACTGAGAGTGGCTTTCACAAT 
GGAAGGGATCAGTAATTTCAAGACACCAAGCAAATTATCAGAAAAAAAGAAATCTGTATT 
ATGTTCAACTCCAACTATAAATATCCCGGCCTCTCCGTTTATGCAGAAGCTTGGCTTTGG 
TACTGGGGTAAATGTGTACCTAATGAAAAGATCTCCAAGAGGTTTGTCTCATTCTCCTT«:; 
GGCTGTAAAAAAGATTAATCCTATATGTAATGATCATTATCGAAGTGTGTATCAAAAGAG 
ACTAATGGATGAAGCTAAGATTTTGAAAAGCCTTCATCATCCAAACATTGTTGGTTATCG 
TGCTTTTACTGAAGCCAATGATGGCAGTCTGTGTCTTGCTATGGAATATGGAGGTGAAAA 
GTCTCTAAATGACTTAATAGAAGAACGATATAAAGCCAGCCAAGATCCTTTTCCAGCAGC 
CATAATTTTAAAAGTTGCTTTGAATATGGCAAGAGGGTTAAAGTATCTGCACCAAGAAA/< 
GAAACTGCTTCATGGAGACATAAAGTCTTCAAATGTTGTAATTAAAGGCGATTTTGAAAC 
AATTAAAATCTGTGATGTAGGAGTCTCTCTACCACTGGATGAAAATATGACTGTGACTGA 
CCCTGAGGCTTGTTACATTGGCACAGAGCCATGGAAACCCAAAGAAGCTGTGGAGGAG/L^ 
TGGTGTTATTACTGACAAGGCAGACATATTTGCCTTTGGCCTTACTTTGTGGG/iAATGAT 
GACTTTATCGATTCCACACATTAATCTTTCAAATGATGATGATGATGAAGATAAAACTTT 
TGATGAAAGTGATTTTGATGATGAAGCATACTATGCAGCGTTGGGAACTAGGGCACCTAT 
TAATATGGAAGAACTGGATGAATCATACCAGAAAGTAATTGAACTCTTCTCTGTATGCAC 
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TAATGAAGACCCTAAAGATCGTCCTTCTGCTGCACACATTGTTGAAGCTCTGGAAACAGA 

TGTCTAGTGATCATCTCAGCTGAAGTGTGGCTTGCGTAAATAACTGTTTATTCCAAAATA 

TTTACATAGTTACTATCAGTAGTTATTAGACTCTAAAATTGGCATATTTGAGGACCATAG 

TTTCTTGTTAACATATGGATAACTATTTCTAATATGAAATATGCTTATATTGGCTATAAG 

CACTTGGAATTGTACTGGGTTTTCTGTAAAGTTTTAGAAACTAGCTACATAAGTACTTTG 

ATACTGCTCATGCTGACTTAAAACACTAGCAGTAAAACGCTGTAAACTGTAACATTAAAT 

TGAATGACCATTACTTTTATTAATGATCTTTCTTAAATATTCTATATTTTAATGGATCTA 

CTGACATTAGCACTTTGTACAGTACAAAATAAAGTCTACATTTGTTTAAAACACTGAACC 

TTTTGCTGATGTGTTTATCAAATGATAACTGGAAGCTGAGGAGAATATGCCTCAAA A AGA 

GTAGGTGCTTGGATACTTCAGACTCTGGTTACAGATTGTCTTGATCTCTTGGATGTCCTC 

AGATCTTTGGTTTTTGGTTTAATTTATTAAATGTATTTTCCATACTGAGTTTAAAATTTA 

TTAATTTGTAGGTTAAGCATTTCCCAGCTGTGTAAAAACAATAAAACTCAAATAGGATGA 

TAAAGAATAAAGGACACTTTGGGTACCAGAAGGTGTCTGAGCATTATTTTATACTTG 

SEQ ID NO: 8 0_AA4 4 954 2_M 

ATCTCCAAGAGGGTTGTCTCATTCTCCTTGGGCCGTGAAAAAGATAAGTCTTTTATGCGA 

TGATCATTATCGAACTGTGTATCAGAAGAGAGTAAGTGATG71AGCTAAGATTTTAAAAAA 

CCTTAATCACCCAAACATTATAGGATATCGTGCTTTTACTGAAGCCAGTGATGGTAGTGT 

GTGCCTTGCTATGGAGTATGGAGGTGAAAAGTCTCTGAATGACTTAATAGAAGAGCGGAA 

CAAAGACAGTGGAAGTCCTTTTCCAGCAGCTGTAATTCTCAGAGTTGCTTTGCACATGGC 

CAGAGGGCTAAAGTACCTGCACCAAGAAAAGAAGCTGCTTCATGGAGACATAAAGTCTTC 

AAATGTTGTAATTAAAGGTGATTTTGAAACAATTAAAATCTGTGATGTAGGAGTCTGTCT 

GCCATTGGATGAAAATATGACTGTGACTGATCCTGAGGCCTGTTATATTGGTACTGAGCC 

ATGGAAACCCAAGGAAGCGTTGGAAGAAAATGGCATCATTACTGAGAAGGCAGATGTGTT 

TGCTTTTGGGCTTACTCTGTGGGAAATGATGACTTTATGTATTCCACACGTCAATCTTCC 

AGATGATGATGTTGATGAAGATGCAACCTTTGATGAGAGTGACTTCGATGATGAAGCATA 

TTATGCAGCTCTGGGGACAAGGGCATGCATCAACATGGAAGAGCTGGATGACTCCTAGCA 

GAAGGGCATTGAACTCTTCTGTGTGTGGAGTAATGAGGATGCTAAAGATCGCCCGTCTGC 

TGCACACATCGTTGAAGCTTTGGAAGTAGATGGGCAATGTTGTGGTCTAAGCTCAAAGCA 

TTAACTTGTATGGGAAGTGTTAAGTAGATATATGTAGTTAATATAACTTATGGTAGCTAG 

ATTCTAGAAGTAGGTTTAAGACTAGTGACCCCTGTCTAAGATGACTTAAGAATCAAGGGA 

CCATTGCTTTGTTACAGATCTTTTTAGATATTCTTGGTTCTTTAGTGGGTTACTAAAAAT 

TTCACTACGTACATGTGGTACAGATATCTGTCTGGTCATAGTGTCAGTCCTTCAGCTGGC 

CTGTCAGCCCATGCGCCCTGGGACTTGAGAAGAGTTCATAAACGTAGCTCCTAGGGTGTG 

TTGGCTCTCTACAGTTAGCTTCTAATTTATTAGTTTGTTTCTACTGATTGTGTCTTAAGT 

GTTTTAAAATAAATGTAAGAATAAACAATAAAAGACAGTTTTAGTAGCAGG 

SEQ ID NO: 8 1_5R57_10_2_M TESK2_M 

GGTGCTGGACAGTGACTTGTATTTACCGTGGAGTGTGAGAGTGAAAGTGGCCTATGGCAT 
AGCAGTGGGCCTCAGGTAGGTTCAGTTGAAAGGCATTTTCCATCGGGAGGTGAGATCAAA 
GGTGTGAAGGCTTTGGTTTC 

SEQ ID NO: 8 2_AA2 3 2 2 5 3_H 

ATGTCGTCTCTGGGTGCCTCCTTTGTGGAAATTAAATTTGATGAGTTGCAGTTTTTTGAA 
AACTGCGGTGGAGGAAGTTTTGGGAGTGTTTATCGAGCCAAATGGATATCACAGGACAAG 
GAGGTGGCTGTAAAGAAGGTCCTCAAAATAGAGAAAGAGGCAGAAATACTCAGTGTCCTC 
AGTCACAGAAACATCATCCAGTTTTATGGAGTAATTCTTGAACCTCCCAACTATGGCATT 
GTCACAGAATATGCTTCTCTGGGATCACTCTATGATTACATTAACAGTAACAGAAGTGAG 
GAGATGGATATGGATCACATTATGACCTGGGCCACTGATGTAGCCAAAGGAATGCATTAT 
TTACATATGGAGGCTCCTGTCAAGGTGATTCACAGAGACCTCAAGTCAAGAAACGTTGTT 
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ATAGCTGCTGATGGAGTATTGAAGATCTGTGACTTTGGTGCCTCTCGGTTCCATAACCAT 
ACAACACACATGTCCTTGGTTGGAACTTTCCCATGGATGGCTCCAGAAGTTATCCAGAG r r 
CTCCCTGTGTCAGAAACTTGTGACACATATTCCTATGGTGTGGTTCTCTGGGAGATG'CTA 
ACAAGGGAGGTCCCCTTTAAAGGTTTGGAAGGATTACAAGTAGCTTGGCTTGTAGTGGAA 
AAAAACGAGAGATTAACCATTCCAAGCAGTTGCCCCAGAAGTTTTGCTGAACTGTTACAT 
CAGTGTTGGGAAGCTGATGCCAAGAAACGGCCATCATTCAAGCAAATCATTTCAATCCTG 
GAGTCCATGTCAAATGACACGAGCCTTCCTGACAAGTGTAACTCATTCCTACAGAACAAG 
GCGGAGTGGAGGTGCGAAATTGAGGCAACTCTTGAGAGGCTAAAGAAACTAGAGCGTGAT 
CTCAGCTTTAAGGAGCAGGAGCTTAAAGAACGAGAAAGACGTTTAAAGATGTGGGAGCA/^ 
AAGCTGACAGAGCAGTCCAACACCCCGCTGCTGCCTTCCTTTGAGATTGGTGCATGGACG 
GAAGACGATGTGTATTGGTGGGTTCAGCAGCTCGTCAGAAAAGGTGACTCTTCAGCAGAG 
ATGAGTGTATATGCAAGCTTGTTTAAAGAAAACAACATTACAGGGAAGCGGCTGCTGCTG 
CTGGAGGAAGAAGACCTGAAAGACATGGGCATTGTCTCCAAGGGGCATATCATTCACTTC 
AAGTCAGCCATTGAGAAATTAACCCATGATTACATAAATTTGTTTCACTTCCCACCACTA 
ATTAAGGACTCAGGAGGTGAACCTGAAGAAAATGAGGAAAAAATAGTGAACCTGGAACTG 
GTTTTTGGTTTTCACTTGAAACCAGGAACTGGCCCACAGGATTGTAAGTGGAAAATGTAT 
ATGGAGATGGATGGGGATGAAATTGCAATAACCTACATAAAAGATGTGACATTCAACACT 
AACCTACCTGATGCGGAGATTTTAAAGATGACAAAGCCACCATTTGTAATGGAGAAGTGG 
ATTGTAGGAATAGCAAAAAGTCAGACTGTGGAGTGCACTGTCACATATGAGAGTGATGTl 1 
AGAACTCCAAAAAGCACTAAACATGTCCATTTGATTCAGTGGAGTAGAACAAAACCTCAG 
GATGAAGTGAAAGCAGTCCAACTTGCCATTCAGACATTATTCACCAATTCAGATGGCAAC 
CCTGGAAGCAGGTCCGACTCAAGTGCTGATTGCCAGTGGTTAGATACTCTGAGGATGCGG 
CAGATTGCATCCAACACTTCTTTACAGCGTTCCCAGAGCAATCCTATTCTGGGGTCACCG 
TTCTTCTCACACTTTGATGGCCAGGATTCCTACGCTGCTGCTGTGAGACGGCCCCAGGTG 
CCCATTAAGTATCAACAGATTACACCTGTGAACCAGTCCAGAAGCTCGTCTCCTACTCAG 
TATGGACTGACCAAAAACTTCTCTTCCTTACATCTCAACTCTAGGGACAGTGGCTTTTCC 
AGTGGCAATACTGACACCTCTTCAGAGAGGGGTCGATACTCAGACAGAAGCAGGAACAAA 
TATGGACGTGGTAGTATATCACTCAATTCTTCTCCTAGAGGAAGATACAGTGGAAAGAGT 
CAGCATTCCACTCCATCAAGAGGAAGATACCCTGGAAAGTTCTACAGGGTTTCTCAGTCA 
GCACTCAATCCTCACCAGTCGCCTGACTTCAAGAGAAGCCCCAGGGACCTCCACCAACCC 
AACACCATACCAGGGATGCCTTTGCACCCTGAGACTGACTCAAGAGCCAGTGAAGAGGAC 
AGCAAAGTCAGCGAAGGGGGCTGGACAAAAGTGGAATACCGGAAAAAGCCCCACAGGCCA 
TCTCCCGCCAAAACCAATAAAGAGAGAGCCAGAGGGGACCACCGTGGATGGAGAAACTTT 

TGA 

SEQ ID NO: 83_AI375137_H 

ATGGGAAATTATAAATCTAGACCAACCCAAACTTGTACTGATGAATGGAAGAAAAAAGT'r 
AGTGAATCATATGTTATCACAATAGAAAGATTAGAAGATGACCTGCAGATCAAGGAAAAA 
GAACTGACAGAACTAAGGAATATATTTGGCTCTGATGAAGCCTTCAGTAAAGTCAATTTA 
AATTACCGCACTGAAAATGGGCTGTCTCTACTTCATTTATGTTGCATTTGTGGAGGCAAG 
AAATCACATATTCGAACTCTTATGTTGAAAGGGCTCCGCCCATCTCGACTGACAAGAAAT 
GGATTTACAGCCTTGCATTTAGCAGTTTACAAGGATAATGCAGAATTGATCACTTCTCTG 
CTTCACAGTGGAGCTGATATACAGCAGGTTGGATACGGTGGCCTCACTGCCCTCCATATT 
GCTACAATAGCTGGCCACCTAGAGGCTGCTGATGTGCTGTTGCAACATGGAGCTAATGTC 
AATATTCAAGATGCAGTTTTTTTCACTCCATTGCATATTGCAGCGTACTATGGACATGAA 
CAGGTAACTCGCCTTCTTTTGAAATTTGGTGCTGATGTAAATGTAAGTGGTGAAGTTGGA 
GATAGACCCCTCCACCTAGCATCTGCAAAAGGATTCTTGAATATTGCAAAACTCTTJ^ATG 
GAAGAAGGCAGCAAAGCAGATGTGAATGCTCAAGATAATGAAGACCATGTCCCACTCCAT 
TTCTGTTCTCGATTTGGACACCATGATATAGTTAAGTATGTGCTGCAAAGTGATTTGGA/- 
GTTCAACCTCATGTTGTTAJ^TATCTATGGAGATACCCCCTTACACCTGGCATGCT7iG/^/r 
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GGCAAATTTGAAGTTGCCAAGGAAATCATCCAAA.TATCAGGAACAGAAAGTCTGACTAAG 
GAAAACATCTTCAGTGAAACAGCTTTTCATAGTGGTTGTACGTATGGGAAGAGGATTGA^ 
CTAGTCAAATTTCTTCTTGATCAGAATGTCA.TAAACATCAACCACCAAGGAAGGGATGGG 
CACACTGGATTACACTCTGCTTGCTACCACGGTCACATTCGCCTGGTTCAGTTCTTACTG 
G A.T AAT G GAG C T G AT ATG AJlT C TAG T G G C T T G TG A T C C C A G C A G G T C T A G TG G T G AAA A A 
GATGAGCAGACATGTTTGATGTGGGCTTATGAAAAAGGGCATGATGCCATTGTCACACTC 
CTGAAGCATTATAAGAGACCACAAGATGAATTGCCCTGTAATGAATATTCTCAGCCTGGA 
GGAGATGGCTCCTATGTGTCTGTTCCATCACCCTTGGGGAAGATTAAAAGCATGACAAAA 
GAGAAGGCAGATATTCTCCTCCTAAGAGCTGGATTGCCTTCACATTTCCATCTTCAGCTC 
TCAGAAATTGAGTTCCATGAGATTATTGGCTCAGGTTCTTTTGGGAAAGTATATAAAGGA 
CGATGCAGAAATAAAATAGTGGCTATAAAACGTTATCGAGCCAATACCTACTGCTCCAAG 
TCAGATGTGGATATGTTTTGCCGAGAGGTGTCCATTCTCTGCCAGCTCAATCATCCCTGC 
GTAATTCAGTTTGTGGGTGCTTGCTTGAATGATCCCAGCCAGTTTGCCATTGTCACTCAA 
TACATATCAGGGGGTTCTCTGTTCTCCCTCCTTCATGAGCAGAAGAGGATTCTTGATTTG 
CAGTCTAAATTAATTATTGCAGTAGATGTTGCCAAAGGCATGGAGTACCTTCACAACCTG 
ACACAGCCAATTATACATCGTGACTTGAACAGTCACAATATTCTTCTCTATGAGGATGGG 
CATGCTGTGGTGGCAGATTTTGGAGAATGAAGATTTCTACAGTCTCTGGATGAAGACAAG 
ATGACAAAACAACCTGGGAACCTCCGTTGGATGGCTCCTGAGGTGTTCACGCAGTGCACT 
CGGTACACCATCAAAGCAGATGTCTTCAGCTATGCTCTGTGTCTGTGGGAAATTCTCACT 
GGCGAAATTCCATTCGCTCATCTCAAGCCAGCGGCTGCGGCAGCAGACATGGCTTACCAC 
CACATCAGACCTCCCATTGGCTATTCCATTCCCAAGCCCATATCATCTCTGCTGATACGA 
GGGTGGAACGCATGTCCTGAAGGAAGACCCGAATTTTCTGAAGTTGTCATGAAGTTAGAA 
GAGTGTCTCTGCAACATTGAGCTGATGTCTGCTGCATCAAGTAACAGGAGTGGGTGTCTC 
TCACCTTCTTCTTCTTCTGATTGCCTGGTGAAGCGGGGAGGACCTGGCCGGAGTCATGTG 
GCAGCATTAAGAAGTCGTTTCGAATTGGAATATGCTCTAAATGCAAGGTCCTATGCTGCT 
TTGTCCCAAAGTGCTGGACAATATTCCTCTCAAGGTCTGTCTTTGGAGGAGATGAAAAGA 
AGTCTTCAATACACACCCATTGACAAATATGGCTATGTATCCGATCCCATGAGCTCAATG 
CATTTTCATTCTTGCCGAAATAGTAGCAGCTTTGAGGAGAGCAGGTGA 

SEQ ID NO: 84_H976 8 5_H 

ATGATTTCTTGCCTGTNATAACCTATGCACTCACAAAGATGAACTCTCTGAGAGGGATGA 
GCAAGAGCTTCAGGAAATCCGAAAGTATTTCTCCTTTCCTGTATTCTTTTTCAAAGTGCC 
GAAACTGGGCTCGGAGATAATAGACTCCTCAACCAGGAGAATGGAGAGCGAAAGATCACC 
GCTTTATCGCCAGCTAATTGACCTGGGCTATCTGAGCAGCAGTCACTGGAACTGTGGGGC 
TCCTGGCCAGG ATA CTAAAGCTCAGAGCATGTTGGTGGAAC AG AGTGAAAAGCTGAGAC A 
CTTGAGCACATTTTCTCACCAGGTGTTACAGACTCGCCTGGTGGATGCAGCCAAGGCCGT 
GAACCTGGTGCACTGCCACTGCCTTGACATCTTTATTAACCAGGCATTTGACATGCAGGG 
GGACCTGCAGATCACTCCCAAACGTCTGGAATATACTCGAAAAAAGGAGAATGAGTTGTA 
TGAATCATTGATGAATATTGCCAACCGAAAGCAGGAGGAAATGAAGGATATGATTGTTGA 
GACACTTAATACCATGAAGGAGGAACTTCTGGATGATGCTACTAACATGGAGTTTAAAGA 
CGTCATTGTCCCTGAGAATGGAGAACCAGTAGGCACCAGAGAGATCAAATGCTGCATCCG 
ACAGATCCAGGAACTCATCATCTCCCGACTTAATCAGGCAGTGGCTAATAAGCTGATCA'J 
CTCAGTGGATTACCTGAGGGAAAGCTTCGTCGGAACCCTGGAACGATGTCTGCAGAGCCT 
GGAGAAGTCTCAGGATGTCTCAGTTCACATCACCAGTAATTATCTCAAACAGATCTTAAA 
TGCTGCCTATCATGTTGAAGTCACGTTTCACTCAGGGTCGTCAGTTACAAGGATGCTATG 
GGAGCAAATCAAACAGATCATCCAGCGCATCACATGGGTGAGCCCACCTGCCATCACTCT 
GGAATGGAAGAGGAAGGTGGCCCAGGAAGCCATTGAGAGCCTCAGCGCCTCCAAATTGGG 
TAAGAGCATTTGCAGCCT^ATTCCGGACTCGGCTCAATAGTTCCCACGAGGCTTTTGCAG r 
GTCCTTGCGGCAGGTGGAAGCTGGCCACTCAGGCCGGTTAGAGAAAACGGAAGATGTATG 
GCTGAGGGTTCGGAJ^GATCATGCTCCCCGCCTGGCCCGCCTTTCTCTGGAAAGCCGTTG 
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TTTACAGGATGTCTTGCTTCATCGTAAACCTAAACTGGGACAGGAACTGGGCCGGGGCCA 

GTATGGTGTGGTATACCTGTGTGACAACTGGGGAGGACACTTCCCTTGTGCCCTCAAATC 

AGTTGTCCCTCCAGATGAGAAGCACTGGAATGATCTGGCTTTGGAATTTCACTATATGAG 

GTCTCTGCCGAAGCATGAGCGATTGGTGGATCTCCATGGTTCAGTCATTGACTACAACTA 

TGGTGGTGGCTCCAGCATTGCTGTGCTCCTCATTATGGAGCGGCTACACCGGGATCTCTA 

CACAGGGCTGAAGGCTGGGCTGACCCTGGAGACACGTTTGCAGATAGCACTAGATGTGGT 

GGAGGGAATCCGCTTCCTGCACAGCCAGGGACTTGTCCATCGTGATATCAAACTGAAAAA 

TGTGCTGCTGGATAAGCAGAACCGTGCCAAGATCACTGACTTAGGATTCTGCAAGCCAGA 

GGCCATGATGTCAGGCAGCATTGTGGGGACACCAATCCATATGGCCCCTGAACTTTTCAC 

AGGGAAGTAGGATAATTCGGTGGATGTGTACGCTTTTGGAATTCTTTTCTGGTATATCTG 

CTGAGGCTGTGTCAAGCTCCCTGAGGCATTTGAGAGGTGTGGTAGCAAAGACCATGTCTG 

GAACAATGTGCGGAGGGGGGCTCGCCCAGAACGTCTTCCTGTGTTTGATGAGGAGTGCTG 

GCAGTTGATGGAAGCCTGTTGGGATGGCGACCCCTTGAAGAGGCCTCTCTTGGGCATTGT 

CCAGCCCATGCTCCAGGGCATCATGAATCGGGTCTGGAAGTCCAATTCTGAGCAGCCAAA 

CAGAGGACTAGATGATTGTAGTTGAAAGCAAAGACCTTTCTGTTTCACTCTCTAGTTATT 

TGCTTCCCCCTCACCATTTGGCCATGGGGAGAATTTGACATTTATTCAGTATAGGACACA 

CTGGCAAGGGAACTGGTGCTTGCTGGGAAAGTTGGAACGTTCCCAGGCAGGGATGACTCC 

TGGACAGTGAAGAGTTGAATGACTGAGCATATTCAGCAGCTCAGTGAAGGGGCAAGCTAT 

CCCTTTAGCAAAAAAGTGTCTCAGATGTGTAAAAGCTGAGGAATGTGGTGTTCTGGCTTG 

ACAAATGAAAAGGAGGCAGATGTT 

SEQ ID NO: 8 5_W2 0810_M 

TTGATGTCAACCTGAAGGCTTCTAAAGCGAGTGATGTCTAGAGCTTTGGGATCCTCGTGT 
GGGCAGTGCTGGCTGGCAGAGAAGCTGAGTTGGTAGACAAGACTTCACTAATCCGGGAAA 
CAGTGTGTGACAGGCAGAGTCGTCCTCCACTGACAGAGCTGCCTCCAGGTAGCCCTGAGA 
CTCGCGGCTTGGAAAAACTGAAGGAGTTAATGATTCATTGCTGGGGTTCCCAGTGGGAAA 
ACAGGCCATCCTTCCAGGACTGCGAACCAAAAACCAATGAAGTTTACAATGTGGTAAAGG 
ACAAGGTAGATGCTGCTGTCTCCGAGGTAAAGCATTATGTGTGTCAGCACAGAAGCAGCG 
GCAGAAACTTGTGTGCCAGAGAGCCAAGCCAAAGAGGCACAGAAATGGATTGGCGGAGGG 
AAACCATGGTTTCTAAAATGCTGGACCGCCTGCATTTGGAGGAACCCTCCGGACCAGTTC 
CTGGAAAATGTCCTGAGAGGCAAGCACAGGACACATCAGTTGGGCCTGCCACACCAGCAA 
GGACATCTTCTGACCCCGTGGCTGGCACTCGTCAGATTGCACATACTTTACCCTTCAGAG 
GCACAACACCTGGGCCAGTCTTTACTGAGACTCCCGGTCCTCACCCCCAAAGGAATCAGG 
GAGATGGAAGACACGGCACTCCTTGGTATCCCTGGACCCCACCGAATCCAATGACAGGGC 
CACCGGCTGTCGTCTTCAACAAGTGTTCTGAAGTGCAGATTGGGAACTACAACTCCTTGG 
TAG CACCA C C AA GAA C T AC TGC C TCAAGTTCGGCCAAGTATG AC CAAGC AC AG TTGGGC A 
GGGGTAGGGGCTGGCAGCCCTTCCACAAGTAGACTTCAGAGAATCACTGCAAGAGCCTGA 
AGTGTGCCATTCAGCGTGGCAATAAAAAGCACGTTTTAAGCAACCTGGACTGGCTAAGAC 
AGTCCTTGCCACTTCCTGAAGCTCACAACATTCTGTGAGGACAGTTGGACCTACACCCAA 
ACTGACTCTTGACCCATCTCCTTAAAGTCAATAAACATAGCATGTTAACTGTG 

SEQ ID NO: 8 6_AA74 4 2 3 6_H 

ATGGGATCAGAGAACAGTGCTTTAAAGAGCTATACACTGAGAGAACCACCATTTACCTTA 
CCCTCTGGACTTGCTGTTTATCCCGCTGTACTGCAAGATGGCAAATTTGCTTCAGTTTTT 
GTGTATAAGAGAGAAAATGAAGACAAGGTTAATAAAGCTGCCAAGCATTTGAAGACACT1 1 
CGTCACCCTTGCTTGCTAAGATTTTTATCTTGTACTGTGGAAGCGGATGGCATTCATCl^T 
GTCACTGAGCGAGTACAGCCCCTGGAAGTGGCTTTGGAAACATTGTCTTCTGCAGAGGTC 
TGTGCTGGGATCTATGACATATTGCTGGCTCTTATCTTCCTTCATGACAGAGGACACCTA 
ACACACAATAATGTCTGTTTATCATCTGTGTTTGTGAGTGAAGATGGACACTGGAAGCTA 
GGAG<GAATGGAAACTGTTTGTAAAGTTTCTCAGGCCACACCAGAGTTTCTGAGGAGTATT 
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CAGTCAATAAGAGACCCAGCATCTATCCCTCCTGAAGAGATGTCTCCAGAATTCACAACT 

CTCCCAGAGTGTCATGGACATGCCCGGGATGCCTTTTCATTTGGAACATTGGTGGAAAG r r 

TTGCTCACAATCTTAAATGAACAGGTTTCAGCGGATGTTCTCTCCAGCTTTCAACAGACC 

TTGCACTCAACTTTGCTGAATCCCATTCCAAAATGTCGGCCAGCGCTCTGCACCTTACTA 

TCTCATGACTTCTTCAGAAATGATTTTCTGGAAGTTGTGAATTTCTTGAAAAGTTTAACA 

TTGAAGAGTGAAGAGGAGAAAACGGAATTCTTTAAATTTCTGCTGGACAGAGTCAGCTGC 

TTGTCAGAGGAATTGATAGCTTCAAGGTTGGTGCCTCTTCTGCTTAATCAGTTGGTGTTT 

GCAGAGCCAGTGGCTGTTAAGAGTTTTCTTCCTTATCTGCTTGGCCCCAAAAAAGATCAT 

GCGCAGGGAGAAACTCCTTGCTTGCTCTCACCAGCCCTGTTCCAGTCACGGGTGATCCCC 

GTGCTTCTCCAGTTGTTTGAAGTTCATGAAGAGCATGTGCGGATGGTGCTGCTGTCTCAC 

ATCGAGGCCTACGTGGAGCACTTCACTCAGGAGCAGCTGAAGAAAGTCATCTTGCCACAG 

GTTTTGCTGGGCCTGCGTGATACTAGCGATTCCATTGTGGCAATTACTCTGCATAGCCTA 

GCAGTGCTGGTCTCTCTGCTTGGACCAGAGGTGGTTGTGGGAGGAGAACGAACCAAGATC 

TTCAAACGCACTGCCCCAAGTTTTACTAAAAATACTGACCTTTCTCTAGAAGGCGATCCA 

TTTTCTCAGCCTATTAAATTTCCCATAAATGGACTCTCAGATGTAAAAAATACTTCGGAG 

GACAGTGAAAACTTCCCATCAAGTTCTAAAAAGTCTGAGGAGTGGCCTGAGTGGAGTGAA 

CCTGAGGAGCCTGAAAATCAAACTGTCAACATACAGATTTGGCCTAGAGAACCTTGTGAT 

GATGTCAAGTCCCAGTGCACTACCTTGGATGTGGAAGAGTCATGTTGGGATGAGTGCGAG 

CCCAGCAGCTTAGATACTAAAGTAAACCCAGGAGGTGGAATCACTGCTACAAAACCTGTT 

ACCTCAGGGGAGCAGAAGCCTATTCCTGCTTTGCTTTCACTCACTGAAGAGTCTATGCCT 

TGGAAATCAAGCTTACCCCAAAAGATTAGCCTTGTACAAAGGGGGGATGACGCAGACCAA 

ATCGAGCCGCCAAAAGTGTCATCACAAGAAAGGCCCCTTAAGGTTCCATCAGAACTTGGT 

TTAGGAGAGGAATTCACCATTCAAGTAAAAAAGAAGCCAGTAAAAGATCCTGAGATGGAT 

TGGTTTGCTGATATGATCCCAGAAATTAAGCCTTCTGCTGCTTTTCTTATATTACCTGAA 

CTGAGGACAGAAATGGTCCCAAAAAAGGATGATGTCTCCCCAGTGATGCAGTTTTCCTCA 

AAATTTGCTGCAGCAGAAATTACTGAGGGAGAGGCTGAAGGCTGGGAAGAAGAAGGGGAG 

CTGAACTGGGAAGATAATAACTGGTGA 

SEQ ID NO: 87_AI052250_H 

AGCGGCCGCGGGGGCGGCGGAGGATATGGAGTAAAGCCAGAGTCAGTGGCCAGGCACGAA 
GGCAGAGCAGGAACAGCCAGGAGGCGTTTATTAGGGGGGCGGGGGGAAAGAGCCCCAGCA 
C CGG CC CTCCTGGAAGAAGGAAGAGGTAACT AT AAGT AC CCAAT ATT GC AG CCATGG AG T 
CCATGCTTAATAAATTGAAGAGTACTGTTACAAAAGTCACAGCTGATGTCACTAGTGCGG 
TAATGGGAATTCCTGTCACTAGAGAATTTGATGTTGGTCGACACATTGCCAGTGGTTGCA 
ATGGGCTAGCTTGGAAGATTTTTAATGGCACAAAAAAGTCAACAAAGCAGGAAGTGGCAG 
TTTTTGTCTTTGATAAAAAACTGATTGACAAGTATCAAAAATTTGAAAAGGATCAAATCA 
TTGATTCTCTAAAACGAGGAGTCCAACAGTTAACTCGGCTTCGACACCCTCGACTTCTTA 
CTGTCCAGCATCCTTTAGAAGAATCCAGGGATTGCTTGGCATTTTGTACAGAACCAGTTT 
TTGCCAGTTTAGCCAATGTTCTTGGTAACTGGGAAAATCTACCTTCCCCTATATCTCCAG 
ACATTAAGGATTATAAACTTTATGATGTAGAAACCAAATATGGTTTGCTTCAGGTTTCTG 
AAGGATTGTCATTCTTGCATAGCAGTGTGAAAATGGTGCATGGAAATATCACTCCTGAJvA 
ATATAATTTTGAATAAAAGTGGAGCCTGGAAAATAATGGGTTTTGATTTTTGTGTA.TCAT 
CAACCAATCCTTCTGAACAAGAGCCTAAATTTCCTTGTAAAGAATGGGACCCAAATTTAC 
CTTCATTGTGTCTTCCAAATCCTGAATATTTGGCTCCTGAATACATACTTTCTGTGAGCT 
GTGAAACAGCCAGTGATATGTATTCTTTAGGAACTGTTATGTATGCTGTATTTAATAAAG 
GG AAAC CTATATTTG AAGT C AA CAAG C AAG AT ATTT AC AAG AG TT T C AG T AG GC AG TTGG 
ATCAGTTGAGTCGTTTAGGATCTAGTTCACTTACAAATATACCTGAGGAAGTTCGTGAAC 
ATGTAAAGCTACTGTTAAATGTAACTCCGACTGTAAGACCAGATGCAGATCAAATGACAA 
AGATTCCCTTCTTTGATGATGTTGGTGCAGTAACACTGCAATATTTTGATACCTTATTCC 
AAAGAGATAATCTTCAGAAATCACAGTTTTTCAAAGGACTGCCAAAGGTTCTACCAAAAC 
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TGCCCAAGCGTGTCATTGTGCAGAGAATTTTGCCTTGTTTGACTTCAGAATTTGTAAACC 
CTGACATGGTACCTTTTGTTTTGCCCAATGTTCTACTTATTGCTGAGGAATGCACCAAAG 
AAGAATATGTCAAATTAATTCTTCCTGAACTTGGCCCTGTGTTTAAGCAGCAGGAGGCAA 
TCCAGATTTTGTTAATTTTCCTACAAAAAATGGATTTGCTACTAACCAAAACCCCTCCTG 
ATGAGATAAAGAACAGTGTTCTACCCATGGTTTACAGAGCACTAGAAGCTGCTTCCATTC 
AGATCCAGGAGCTCTGTCTAAACATCATTCCAACCTTTGCAAATCTTATAGACTACCCAT 
CCATGAAAAACGCTTTGATACCAAGAATTAAAAATGCTTGCTACAAACATCTTCCCTTGC 
GGTTCGTGTAAATTCATTAAACAACATTGGAGCAGACCTTGTGACTGGCAGTGAGTGCG 

SEQ ID NO: 88_AA278842_H 

GGCGCGCCAGATATCACACGTGCCAAGGGGCTGGCTCAGCCCCGGCTCGGGCGGCCGGAG 
GACCCGGAGCTAAGGCGCCCGAACCCGCGGCGGCGGTGGGGACGATGTGGTTCTTTGCCC 
GGGACCCGGTCCGGGACTTTCCGTTCGAGCTCATCCCGGAGCCCCCAGAGGGCGGCCTGC 
CCGGGCCCTGGGCCCTGCACCGCGGCCGCAAGAAGGCCACAGGCAGCCCCGTGTCCATCT 
TCGTCTATGATGTGAAGCCTGGCGCGGAAGAGCAGACCCAGGTGGCCAAAGCTGCCTTCA 
AGCGCTTCAAAACTCTACGGCACCCC7^ACATCCTGGCTTACATCGATGGACTGGAGACAG 
AAAAATGCCTCCACGTCGTGACAGAGGCTGTGACCCCGTTGGGAATATACCTCAAGGCGA 
GAGTGGAGGCTGGTGGCCTGAAGGAGCTGGAGATCTCCTGGGGGCTACACCAGATCGTGA 
AAGCCCTCAGCTTCCTGGTCAACGACTGCAGCCTCATCCACAACAATGTCTGCATGGCCG 
CCGTGTTCGTGGACCGAGCTGGCGAGTGGAAGCTTGGGGGCCTGGACTACATGTATTCGG 
CCCAGGGCAACGGTGGGGGACCTCCCCGCAAGGGGATCCCCGAGCTTGAGCAGTATGACC 
CCCCGGAGTTGGCTGACAGCAGTGGCAGAGTGGTCAGAGAGAAGTGGTCAGCAGACATGT 
GGCGCTTGGGCTGCCTCATTTGGGAAGTCTTCAATGGGCCCCTACCTCGGGCAGCAGCCC 
TACGCAACCCTGGGAAGATCCCCAAAACGCTGGTGCCCCATTACTGTGAGCTGGTGGGAG 
CAAACCCCAAGGTGCGTCCCAACCCAGCCCGCTTCCTGCAGAACTGCCGGGCACCTGGTG 
GCTTCATGAGCAACCGCTTTGTAGAAACCAACCTCTTCCTGGAGGAGATTCAGATCAAAG 
AGCCAGCCGAGAAGCAAAAATTCTTCCAGGAGCTGAGCAAGAGCCTGGACGCATTCCCTG 
AGGATTTCTGTCGGCACAAGGTGCTGCCCCAGCTGCTGACCGCCTTCGAGTTCGGCAATG 
CTGGGGCCGTTGTCCTCACGCCCCTCTTCAAGGTGGGCAAGTTCCTGAGCGCTGAGGAGT 
ATCAGCAGAAGATCATCCCTGTGGTGGTCAAGATGTTCTCATCCACTGACCGGGCCATGC 
GCATCCGCCTCCTGCAGCAGATGGAGCAGTTCATCCAGTACCTTGACGAGCCAACAGTCA 
ACACCCAGATCTTCCCCCACGTCGTACATGGCTTCCTGGACACCAACCCTGCCATCCGGG 
AGCAGACGGTCAAGTCCATGCTGCTCCTGGCCCCAAAGCTGAACGAGGCCAACCTCAATG 
TGGAGCTGATGAAGCACTTTGCACGGCTACAGGCCAAGGATGAACAGGGCCCCATCCGCT 
GCAACACCACAGTCTGCCTGGGCAAAATCGGCTCCTACCTCAGTGCTAGCACCAGACACA 
GGGTCCTTACCTCTGCCTTCAGCCGAGCCACTAGGGACCCGTTTGCACCGTCCCGGGTTG 
CGGGTGTCCTGGGCTTTGCTGCCACCCACAACCTCTACTCAATGAACGACTGTGCCCAGA 
AGATCCTGCCTGTGCTCTGCGGTCTCACTGTAGATCCTGAGAAATCCGTGCGAGACCAGG 
CCTTCAAGGCCATTCGGAGCTTCCTGTCCAAATTGGAGTCTGTGTCGGAGGACCCGACCC 
AGCTGGAGGAAGTGGAGAAGGATGTCCATGCAGCCTCCAGCCCTGGCATGGGAGGAGCCG 
CAGCTAGCTGGGCAGGCTGGGCCGTGACCGGGGTCTCCTCACTCACCTCCAAGCTGATCC 
GTTCGCACCCAACCACTGCCCCAACAGAAACCAACATTCCCCAAAGACCGAGGCCTGAAG 
GAGTTCCTGCCCCAGCCCCCACCCCTGTTCCTGCCACCCCTACAACCTCAGGCCACTGGG 
AGACGCAGGAGGAGGACAAGGACACAGCAGAGGACAGCAGCACTGCTGACAGATGGGACG 
ACGAAGACTGGGGCAGCCTGGAGCAGGAGGCCGAGTCTGTGCTGGCCCAGCAGGACGACT 
GGAGCACCGGGGGCCAAGTGAGCCGTGCTAGTCAGGTCAGCAACTCCGACCACAAATCCT 
CCAAATCGCCAGAGTCCGACTGGAGCAGCTGGGAAGCTijAGGGCTCCTGGGAACAGGGT'I'- 
GGCAGGAGCCAAGCTCCCAGGAGCCAGCTCCTfjACC^GTACACGGGTGGCCAGCGAGTATA 
ACTGGGGTGGCCCAGAGTCCAGCGACAAGGGCGACCCCTTCGCTACCCTGTCTGCACGTC 
CCAGCACCCAGCCGAGGCCAGACTCTTGGGGTGAGGACAACTGGGAGGGCCTCGAGACTG 
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ACAGTCGACAGGTCAAGGCTGAGCTGGCCGGGAAGAAGCGCGAGGAGCGGCGGGGGGAGA 
TGGAGGCCAAACGCGCCGAGAGGAAGGTGGCCAAGGGCCCCATGAAGCTGGGAGCCCGGA 
AGCTGGACTGAACCGTGGCGGTGGCCCTTCCCGGCTGCGGAGAGCCCGCCCCACAGATGT 
ATTTATTGTACAAACCATGTGAGCCCGGCCGGCCCAGCCAGGCCATCTCACGTGTACATA 
ATCAGAGCCAGAATAAATTGTATTTCAC 

SEQ ID NO: 8S_AA599286_H 

ATGGCCTTCATGGAGAAGCCGCCAGCCGGCAAGGTGCTGCTGGACGACACGGTGCCGCTG 

CAAGGAGGAATTTCTGTGGAAAAGAGCTGGCAGATTGTTAGAAGATACAGTGACTTTGAT 
TTGCTTAACAACAGCTTACAGATTGCAGGCCTAAGTCTACCTCTTCCTCCCAAAAAATTG 
ATTGGTAACATGGATCGTGAATTCATAGCTGAAAGGCAGAAAGGTCTTCAGAACTATCTG 
AACGTGATCACAACAAATCATATCTTGTCTAATTGTGAGGTGGTTAAGAAGTTTTTAGAT 
CCAAACAACTATTCCGCAAACTATACTGAGATTGCCTTGCAACAGGTTTCCATGTTCTTC 
CGATCAGAGCCAAAGTGGGAGGTGGTGGAACCTTTGAAAGACATAGGTTGGAGAATAAGG 
AAGAAATATTTCTTGATGAAGATTAAAAATCAGCCAAAGGAACGGCTAGTGTTAAGCTGG 
GCTGACCTTGGCCCAGACAAGTATTTGTCAGATAAAGATTTTCAGTGTCTAATCAAACTT 
CTGCCTTCTTGTTTGCACCCTTACATCTATCGGGTTACCTTTGCCACAGCTAATGAATCG 
TCAGCGTTGCTAATTAGGATGTTTAACGAAAAGGGAACATTGAAGGATCTGATCTACAAG 
GCAAAACCAAAAGACCCATTTCTAAAGAAGTACTGCAACCCTAAGAAGATTCAGGGCCTG 
GAACTCCAGCAAATAAAAACATATGGACGGCAAATATTAGAGGTACTGAAGTTTCTTCAT 
GACAAGGGATTCCCTTATGGGCATCTTCACGCCTCCAATGTGATGCTCGATGGGGACACT 
TGCCGGCTGCTGGACCTTGAGAATTCCTTATTGGGCCTGCCTTCCTTCTACCGATCTTAT 
TTTTCACAATTCAGGAAAATCAATACATTGGAAAGTGTGGATGTCCAGTGCTTTGGCCAC 
TTACTGTATGAAATGACTTATGGAGGACCGCCAGACTCGGTGCGTGTGGACTCCTTCCCT 
CCTGCCCCGTCCATGGCTGTGGTGGCCGTGTTGGAGTCTACGCTGTCTTGTGAAGCCTGT 
AAAAATGGCATGCCTACCATGTCCCGGCTCTTACAGATGCCATTATTCAGCGATGTTTTA 
CTAACCAGTTCTGAAAAACCACAGTTTAAGATCCCTACAAAGTTAAAAGAGGCATTGAGA 
ATTGCCAAAGAATGTATAGAGAAGAGACTAATTGAGGAACAGAAACAGATTCACCAGCAT 
CGAAGACTGACAAGAGCTCAGTCCCACCATGGATCTGAGGAGGAAAGAAAAAAAAGAAAG 
ATTTTAGCTCGAAAGAAGTCAAAACGATCTGCTCTTGAAAATAGTGAAGAGCATTCAGCG 
AAGTACAGCAACTCCAATAATTCAGCAGGATCTGGGGCCAGCTCACCTCTCACGTCCCCG 
TCATCGCCAAGTCCACCCTCTACATCAGGGATATCTGCATTACCTCCACCTCCTCCACCT 
CCACCACCACCAGCAGCTCCCTTGCCTCCTGCGAGCACCGAGGCACCTGCCCAGCTCTCG 
TCTCAGGCTGTGAATGGCATGAGCCGAGGGGCCTTGCTGAGCTCCATCCAGAATTTGCAA 
AAAGGAACTTTGAGGAAAGCCAAACCTGTGATCAGAGTGCTCGGAAGATCGGCTGAAGCT 
TCCTGTTTACAGTTGGAGGGAAAAGTTCTTTTTTATTCGTAGTCACCCCTACGCCCCAAC 
TACCCTCTTCCTGGGAAAGTAATTGCTGAGCCAGTACAGCCACAAACAGTACTATTTTGC 
AGATGCTCATGTAAGCAGCTTTTCGAGAGAAATAATTCTTTAAGCAGAATAAAGTTAGGC 
TG G CATG C AAAAAAAAAAAAAAAAAAAAA 

SEQ ID NO: 9 0_AA4 2 57 2 5_H 

ATGAGCGCCAGCACGGGCGGTGGTGGGGACAGCGGCGGCAGCGGCGGCAGTAGCAGCAGC 
TCACAGGCCTCCTGCGGGCCCGAGTCCTCGGGCTCCGAACTAGCCCTGGCCACACCGGTG 
CCTCAGATGCTGCAGGGCCTTCTGGGCTCCGACGACGAGGAACAGGAAGACCCCAAAGAC 
TACTGCAAGGGCGGCTACCACCCTGTGAAGATCGGCGACGTGTTCAATGGGCGGTACCAC 
GTGGTGCGCAAACTGGGCTGGGGCCACTTCTCCACCGTCTGGCTCTGCTGGGACATCCAG 
CGCAAGCGCTTTGTGGCCCTCAAAGTGGTGAAGAGTGCGGGGCATTACACGGAGACAGCT 
GTGGATGAGATCAAGCTCCTGAAATGTGTCCGGGACAGCGACCCCAGTGACCCCAAAAGA 
GAGACCATTGTCCAGCTCATTGATGACTTCAGGATCTCAGGAGTGAATGGAGTCCATGTG 
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TGCATGGTGCTGGAGGTGCTGGGCCACCAGCTCCTCAAATGGATCATCAAGTCCAACTAC 
CAGGGCCTGCCCGTGCCCTGCGTGAAGAGCATCGTGAGGCAGGTGCTGCACGGCCTGGAC 
TACCTCCACACCAAGTGCAAGATCATCCACAC(jGACATCAAGCCCGAGAACATCT'I , GGTG 
TGTGTGGGGGACGCTTACATCAGGCGCCTGGCTGCCGAGGCCACGGAGTGGCAACAG'3CA 
GGGGCGCCGCCCCCCTCCCGCTCCATAGTCAGCACTGCCCCCCAGGAGGTCTTGACCGGT 
AAGCTGTCCAAAAACAAGAGGAAGAAGATGAGGCGCAAACGGAAACAGCAGAAGCGGCTG 
CTGGAGGAGCGGCTGCGGGACCTGCAGAGGCTGGAGGCCATGGAGGCTGCCACCCAGGCT 
GAGGACTCTGGCTTGAGACTAGACGGGGGCAGCGGCTCCACATCCTCTTCAGGCTTCTCC 
GGCTCCCTCTTCTCTCCTGCCTCGTGCTCCATCCTCTCCGGCTCGTCCAATCAGCGAGAG 
ACCGGGGGCCTCGTGTCGCCTAGCACACCATTCGGTGCCTCGAACCTCCTGGTGAACCCC 
CTGGAGCCCCAAAATGCAGATAAGATCAAGATCAAGATCGCAGACCTGGGCAACGCCTGC 
TGGGTGCACAAGCACTTCACGGAAGACATCCAGACTCGGCAGTACCGGGCCGTCGAGGTG 
CTGATCGGCGCCGAATACGGCCCCCCGGCAGACATCTGGAGCACAGCCTGCATGGCCTTC 
GAGCTGGCCACTGGTGACTACCTGTTCGAGCCGCATTCTGGAGAAGACTACAGTCGTGAT 
GAGGACCACATCGCTCACATAGTGGAGCTTCTGGGGGACATCGCCCCAGCCTTCGCCCTC 
TCAGGCCGCTATTCCCGGGAGTTCTTCAACCGGAGAGGAGAGCTGCGGCACATCCACAAT 
CTCAAGCACTGGGGCCTGTACGAGGTAGTCATGGAAAAGTACGAGTGGCCCCTAGAGCAG 
GCCACACAGTTCAGCGCCTTTCTGCTGCCCATGATGGAGTACATCCCCGAAAAGCGGGCC 
AGTGCCGCTGACTGCCTCCAGCACCCCTGGCTCAACCCCTAG 

SEQ ID NO: 91_SGK022_H 

TCTGGCCCTGTCCCTCCCCACCACCCGCCGCTGTGTCCAGACAGAGAATGTTCTAACGCT 
GGGGGCGGCTGCGGATGAAGTCCTTGGGGAGAAAAGGAGCAGGCCAAGGGCGATGGTGGA 
GTAGAGCTGCCTCTCAGAGGCAGCATGAGCTGAGAGGGTGATAGGAAGGCGGCGCTAGAC 
AGCATGGAGGACTTTCTGCTCTCCAATGGGTACCAGGTGGGCAAGACCATTGGGGAAGGG 
ACCTACTCAAAAGTCAAAGAAGCATTTTCCAAAAAACACCAAAGA7VAAGTGGCAATTAAA 
GTTATAGACAAGATGGGAGGGCCATCAGAGTTTATCCAGAGATTCCTCCCTCGGGAGCTC 
CAAATCGTCCGTACCCTGGACCACAAGAACATCATCCAGGTGTATGAGATGCTGGAGTCT 
GCCGACGGGAAAATCTGCCTGGTGATGGAGGTCGCTGAGGGAGGGGATGTCTTTGACTGC 
GTGCTGAATGGGGGGCCACTGCCTGAAAGCCGGGCCAAGGCCCTCTTCCGTCAGATGGTT 
GAGGCCATCCGCTACTGCCATGGCTGTGGTGTGGCCCACCGGGACCTCAAATGTGAGAAC 
GCCTTGTTGCAGGGCTTCAACCTGAAGCTGACTGACTTTGGCTTTGCCAAGGTGTTGCCC 
AAGTCACACCGGGAGCTGAGCCAGACCTTCTGCGGCAGTACAGCCTATGCTGCCCCCGAG 
GTGCTGCAGGGCATTCCCCACGATAGCAAAAAAGGTGATGTCTGGAGCATGGGTGTGGTC 
CTGTATGTCATGGTCTGTGCCAGCGTACCTTTTGACGACACAGACATCCCCAAGATGCTG 
TGGCAGCAGCAGAAGGGGGTGTCCTTCCCCAGTCATCTGAGCATCTCGGCCGATTGGCAG 
GACCTGCTCAAGAGGCTCGTGGAACCCGATATGATCCTCCGGCCTTCAATTGAAGAAGTT 
AGTTGGCATCCATGGCTAGCAAGCACTTG/iTAAAAGCAATGGCAAGTGCTCTCCAATAAA 
G T AGGGGG AG AAAGCAAA 

SEQ ID MO: 92_AA0 G 0 0 2 6_M SGK022_M 

CAGACGGAGAATGTTCTAGCCCTGGAGGCAGCTGTGAATGAAGTCCTTGGGGGGAAAAGA 
AGCAGGCCGAGGGCGATGGTGGAGTAGAGCTGCCTCGCAGAGGCAGCATGAGCTGAGAGG 
GTGACAAGAAGGAGGCGCTACACAGCATGGAGGsACTTTCTACTCTCCAATGGGTATCAGC 
TGGGCAAGACCATTGGGG^AGGGACCTACTCAAAAGTCAAAGAAGCATTTTCCAAAAAAC 
ATCAAAGAAAAGTGGCAATTAAAATTATAGACAAGATGGGAGGGCCAGAAGAGTTTATCC 
AGAGATTCCTGCCTCGTGAGCTCCAGATTGTCCGTACCCTGGACCACAAA/vACATCATCC 
AGGTGTATGAGATGCTGGAGTCAGGAGATGGAAAAATCTACCTGGTGATGGAACTGGCT'i 
AGGGAGGGGATGTCTTTGACTGTGTGCTGAACGGAGGGCCACTTCCCGAGAGCCGGGCCA 
AGGCCCTCTTCCGCCAGATGGTTGAGGCTATTCGCTATTGCCATGGCTGTGGCGTGGCCC 
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ACCGGGACCTTAAGTGTGAGAACGCCTTGTTGCAGGGCTTCAACCTGAAGCTGACCGACT 
TTGGCTTTGCCAAGGTGCTACCCAAGTCACGCAGGGAGCTGAGCCAGACCTTCTGTGGCA 
GCACAGCCTATGCCGCCCCTGAGGTGCTACAGGGCATACCCCATGATAGCAAGAAAGGTG 
ATGTCTGGAGCATGGGTGTGGTCCTGTATGTAATGCTCTGTGCAAGTCTACCTTTTGATG 
ACACAGATATCCCCAAGATGCTGTGGCAGCAGCAGAAGGGGGTGTCCTTCCCCACTCATT 
TGGGCATCTCAACCGAATGCCAGGACCTGCTCAAGCGGCTCCTGGAACCAGACATGATAC 
TCCGGCCTTCAATCGAAGAAGTTAGTTGGCACCCATGGCTAGCAAGCACTTGATAJ^AAGC 
AATGGCAAGTCCTCCCCAATAAAGTAGGGGGAGAAAGCAAACTG 

SEQ ID NO: 93_AA3 9 966 9__H 

CTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCGCCCGGCCGCACTTCATTCTCAA 

GTTTTGTGGCCAACGATGGATAGGAGGTGGATTGTGATGTATTCGGAACATGGGACCTTG 

AGGAGTTCCGTAACCAAAAGGAGAAAGTAACAACAGCCAGTGGAGACAAAAAGAACTGCT 

TCTCTTTCTTTCCCCCTCCAAGTTCCTAGTGGAGGGCTGAGTCCAGGATCCCAGACTCGT 

GTGACTATATAGGCAAGCATTTGGGGACCTACTTCACTTTGATACCCTAGCCTTCAGCAG 

CTCAAGGTGTTGGCCTTTGGATAGGAGGCTTCCAAGTAGTAAAGCTCCCTGCTCTGAGCA 

AGCCCAACACCATGGGGAAGGGAGATGTCTTAGAGGCAGCACCAACCACCACAGCCTAGC 

ATTCCCTCATGGATGAATATGGTTATGAGGTGGGCAAGGCCATTGGCCATGGCTCCTATG 

GG T C GGT AT ATG AGG C TT T CT AC AC AAAG C AG AAGGTTATGGTGGCAGTCAAG AT C AT CT 

CAAAGAAGAAGGCCTCTGATGACTATCTTAACAAGTTCCTGCCCCGTGAAATACAGGTAA 

TGAAAGTCTTGCGGCACAAGTACCTCATCAACTTCTATCGGGCCATTGAGAGCACATCTC 

GAGTATACATCATTCTGGAACTGGCTCAGGGTGGTGATGTCCTTGAATGGATCCAGCGCT 

ACGGGGCCTGCTCTGAGCCCCTTGCTGGCAAGTGGTTCTCCCAGCTGACCCTGGGCATTG 

CCTACCTGCACAGCAAGAGCATCGTGCACCGGGACTTAAAGTTGGAGAACCTGTTGCTGG 

ACAAGTGGGAGAATGTGAAGATATCAGACTTTGGCTTTGCCAAGATGGTGCCTTCTAACC 

AGCCTGTGGGTTGTAGCCCTKCTTACCGCCAAGTGAACTGCTTTTCCCACCTCAGCCAGA 

CTTACTGTGGCAGCTTTGCTTACGCTTGCCCAGAGATCTTACGAGGCTTGCCCTACAACC 

CTTTCCTGTCTGACACCTGGAGCATGGGCGTCATCCTTTACACTCTAGTGGTCGCCCATC 

TGCCCTTTGATGACACCAATCTCAAAAAGCTGCTAAGAGAGACTCAGAAGGAGGTCACTT 

TCCCAGCTAACCATACCATCTCCCAGGAGTGCAAGGTCCAACTGCTCATTGCCTGTGTGG 

CACAATGGAGAAAAACTCAGGCAAGACCTCTCTCTCCCCTGCTCTAGAACCTGATCCTCC 

AGATGCTACGCCAAGCCACTAAGCGTGCCACCATTCTGGACATCATCAAGGATTCCTGGG 

TGCTCAAGTTCCAGCCTGAGCAACCCACCCATGAGATCAGGCTGCTTGAGGCCATGTGCC 

AGCTCCACAACACCACTAAACAGCACCAATCCTTGCAAATTACGACCTGAAAATGGCTGA 

GGGAGGGGGCTAAGAGAGGAGCAAAGCAGGAGGTCTTGGGCTAAAAATCTTTTTTACCAA 

AAATAAATCTAAGTCTGATTTAGTTTCATCAAAAAAA 

SEQ ID NO; 94_AA7 5 8 5 3 9_H 

GACCATTCAGACGCCTCCGGTAGTGTAAATGAGGACAATGCCTGCTGGCCCACATGACGG 
GGGGATGTAGACGGCAGCGGCGCCAGTCGCTCCTGGCACCATGGACGATGCCACAGTCCT 
AAGGAAGAAGGGTTACATCGTAGGCATCAATCTTGGCAAGGGTTCCTACGCAAAAGTCAA 
ATCTGCCTACTCTGAGCGCCTCAAGTTCAATGTGGCTGTCAAGATCATCGACCGCAGGAA 
AACACCTACTGACTTTGTGGAGAGATTCCTTCCTCGGGAGATGGACATCCTGGCAACTGT 
CAACCACGGCTCCATCATCAAGACTTACGAGATCTTTGAGACCTCTGACGGACGGATCTA 
CATCATCATGGAGCTTGGCGTCCAGGGCGACCTCCTCGAGTTCATCAAGTGCCAGGGAGC 
CCTGCATGAGGACGTGGCACGCAAGATGTTCCGACAGCTCTCCTCCGCCGTCAAGTACTG 
CCACGACCTGGACA.TCGTCCACCGGGACCTCAAGTGCGAGAACCTTCTCCTCGACAAGGA 
CTTCAACATCAAGCTGTCTGACTTTGGCTTCTCCAAGCGCTGCCTGCGGGACAGCAATGG 
GCGCATCATCCTCAGCAAGACCTTCTGCGGGTCGGCAGCATATGCAGCCCCCGAGGTGCT 
GCAGAGCATCCCCTACCAGCCCAAGGTGTATGACATCTGGAGCCTGGGCGTGATCCTGTA 
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CATCATGGTCTGCGGCTCCATGCCCTATGACGACTCCGACATCAGGAAGATGCTGCGTAT 
CCAGAAGGAGCACCGTGTGGACTTCCCGCGCTCCAAGAACCTGACCTGCGAGTGCAAGGA 
CCTCATCTACCGCATGCTGCAGCCCGACGTCAGCCAGCGGCTCCACATCGATGAGATCCT 
CAGCCACTCGTGGCTGCAGCCCCCCAAGCCCAAAGCCACGTCTTCTGCCTCCTTCAAGAG 
GGAGGGGGAGGGCAAGTACCGCGCTGAGTGCAAACTGGACACCAAGACAGGCTTGAGGCC 
CGACCACCGGCCCGACCACAAGCTTGGAGCCAAAACCCAGCACCGGCTGCTGGTGGTGCC 
CGAGAACGAGAACA03ATGGAGGACAGGCTGGCCGAGACCTCCAGGGCCAAAGACCATCA 
CATCTCCGGAGCTGAGGTGGGGAAAGCAAGCAGGTAGCATGACAATGGCGCCGTTGTGTG 
TGGTGGGGGTGGGGGTTGGGGGGCATGGTGCAGTCGGCCTTCACGTAAACTAAGTAGGGA 
GGTAGGATCTGAAGAAGGCACAGGTGCAAGTAAAATTCGTCAATTAAACCACTATTTTGA 
TT 

SEQ ID NO: 95_AA8 8 3 97 5_H 

ATGTCGGGAGACAAACTTCTGAGCGAAGTCGGTTATAAGCTGGGCCGCACAATTGGAGAG 
GGCAGCTACTCCAAGGTGAAGGTGGCCACATCCAAGAAGTACAAGGGTACCGTGGCCATC 
AAGGTGGTGGACCGGCGGCGAGCGCCCGCGGAGTTCGTCAAGAAGTTCCTGCCGCGAGAG 
CTGTCCATCCTGCGGGGCGTGCGACACGCGCAGATCGTGCAGGTCTTCGAGTTCATCGAG 
GTGTGCAACGGGAAACTGTACATCGTGATGGAAGCGGCCGCCACCGACCTGCTGCAAGCC 
GTGCAGCGCAACGGGCGCATCCCCGGAGTTCAGGCGCGCGACCTCTTTGGGCAGATCGGC 
GGCGCCGTGCGCTACCTGCACGATCATCACCTGGTGCACCGCGACCTCAAGTGCGAAAAC 
GTGCTGCTGAGGCCGGACGAGCGGCGCGTCAAGGTCACCGACTTCGGCTTCGGGCGCCAG 
GCCCATGGGTACCCAGAGCTGAGCACCACCTAGTGGGGGTCAGCCGCCTACGCGTCACGG 
GAGGTGGTGCTGGGCATCCCCTACGACCCCAAGAAGTACGATGTGTGGAGCATGGGCGTG 
GTGCTCTACGTCATGGTGACCGGGTGCATGCCCTTCGACGACTGGGACATCGCCGGCCTG 
CCCCGGCGCCAGAAACGCGGCGTGGTCTATCCCGAAGGCCTCGAGCTGTCCGAGCGGTGC 
AAGGCCCTGATCGCCGAGCTGCTGCAGTTCAGCCCGTCCGCCAGGCCCTCCGCGGGCCAG 
GTAGCGCGCAACTGCTGGCTGCGCGCCGGGGACTCCGGCTAG 

SEQ ID NO: 96_AA90 544 6_H 

CTGGTAGAGAACAGGGGCTGGTGGCAAGGCGCATGGAGATGAGAAAACGGAAGACAGGGA 
TCATGGAAAGAATTGTGGGGTCAGGGGACAGTGGCGGGAGGAGCTGGCTCACCACCCTGT 
GGACAAATCAGGCCTTATAATTTGTGATTCTGTGGCTTTGTCTAAAAGTCCATAAAGCAC 
GTTGATATCCAGTGTCACAGACTGCTCACAAGAGTCCACAAGGCTGGTGGGGAGTGCTTC 
TTTTGAATGATATACTAACGACAAAAATAATAGAAGTGAACATTGTTTGCAATGTGCAAG 
CAG CTAG AC ACACTTAAGAC C ATTAAG AAAGC GAAGAAAT AAG AC C GAG AC AAGGTGGG C 
AGAAGTTGGAAGGCAGGAGACAGGTGTGAGGAGGTGGGCCTTTCTGATCTGCCAGCCCAT 
CTCTCCTCCCCTTACTTCCTCAGAGTTTATCCAGAGATTCCTCCCTCGGGAGCTCCAAAT 
CGTCCGTACCCTGGACCACAAGAACATCATCCAGGTGTATGAGATGCTGGAGTCTGCCGA 
CGGGAAAATCTGCCTGGTGATGGAGCTCGCTGAGGGAGGGGATGTCTTTGACTGCGTGCT 
GAATGGGGGGCCACTGCCTGAAAGCCGGGCCAAGGCCCTCTTCCGTCAGATGGTTGAGGC 
CATCCGCTACTGCCATGGCTGTGGTGTGGCCCACCGGGACCTCAAATGTGAGAACGCCTT 
GTTGCAGGGCTTCAACCTGAAGCTGACTGACTTTGGCTTTGCCAAGGTGTTGCCCAAGTC 
ACACCGGGAGCTGAGCCAGACCTTCTGCGGCAGTACAGCCTATGCTGCCCCCGAGGTGCT 
GCAGGGCATTCCCNNCAAGATGCTGTGGCAGCAGCAGAAGGGGGTGTCCTTCCCCACTCA 
TCTGAGCATCTCGGCCGATTGCCAGGACCTGCTCAAGAGGCTCCTGGAACCCGATATGAT 
CCTCCGGCCTTCAATTGAAGAAGTTAGTTGGCATCCATGGCTAGCAAGCACTTGATAAA?=. 
GCAATGGCAAGTGCTCTCCAATAAAGTAGGGGGAGAAAGCAAAC CC 
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SEQ ID NO: 97_H29974_H 

TTACAGCCTGTTGGCGGAGATCGGGCGCGGCAGCTACGGCGTGGTTTATGAGGCAGTGGC 
CGGGCGCAGCGGGGCCCGGGTGGCGGTCAAGAAGATCCGCTGCGACGCCCCCGAGAACGT 
GGAGCTGGCGCTGGCTGAATTCTGGGCCCTCACCAGCCTCAAGCGGCGCCACCAuAACGT 
CGTGCAGTTTGAGGAGTGCGTCCTGCAGCGCAATGGGTTAGCCCAGCGCATGAGTCACGG 
CAACAAGAGCTCGCAGCTTTACCTGCGCCTGGTGGAGACCTCGCTGAAAGGAGAAAGGAT 
CCTGGGTTATGCTGAGGAGCCCTGCTATCTCTGGTTTGTCATGGAGTTCTGTGAAGGTGG 
AGACCTGAATCAGTATGTCCTGTCCCGGAGGCCAGACCCAGCCACCAACAAAAGTTTCAT 
GCTACAGCTGACGAGCGCCATTGCCTTCCTGCACAAAAACCATATTGTGCACAGGGACCT 
GAAGCCAGACAACATCCTCATCACAGAGCGGTCTGGCACCCCCATCCTCAAAGTGGCCGA 
CTTTGGACTAAGCAAGGTCTGTGCTGGGCTGGCACCCCGAGGCAAAGAGGGCAATCAAGA 
CAACAAAAATGTGAATGTGAATAAGTACTGGCTGTCCTCAGCCTGCGGTTCGGACTTCTA 
CATGGCTCCTGAAGTCTGGGAGGGACACTACACAGCCAAGGCGGACATCTTTGCCCTGGG 
CATTATCATCTGGGCAATGATAGAAAGAATCACTTTTATTGACTCTGAGACCAAGAAGGA 
GCTCCTGGGGACCTACATTAAACAGGGGACTGAGATCGTCCCTGTTGGTGAGGCGCTGCT 
AGAAAACCCAAAGATGGAGTTGCACATCCCCCAAAAACGCAGGACTTCCATGTCTGAGGG 
GATCAAGCAGCTCTTGAAAGATATGTTAGCTGCTAACCCACAGGACCGGCCTGATGCCTT 
TGAACTTGAAACCAGAATGGACCAGGTCACATGTGCTGCTTAAAATTCAGGGCTAAGCAT 
TTTGGGTGATTTTAAACTAGGTCGATTCCTCGGGACCCACAGTCTCACCACGTCTCCTCC 
AGAGGACGGCAGAGGGTACAGGTGGTGGCCTGGCCGGTTGGCGATCTCCCGACAGCTGGA 
TCCGGCAATGTGAAGCTTTTGTTTGGGTTTCCCCGCTTCTTTTTAGTTTTGCTTTATTTN 
TNNCCTTTTCTTTTCTTTTTTTNTTTNCCACNTNCCTTTTTTTAAATTTAAACCATTGAG 
ACTTCAGAAGAGCAGGACACAATGCTGTGGACAGGCACCAATTTCTTTAAAGAAATTCAA 
TGTGGGCAAGGCATATGTGTAAATTTCACTTTTACTTTTTATAAGGGGTTAGGGAGCTAT 
TTTTGGTTTTGTCCTTCACTTTCCCTCTGTCTTCCTTCTTTATACTTTTCTCAGTTCTAC 
TTATGACACCTCACTTCCCTAGAGAAGGCCTGCCTCCCCATAGGGAATCTGGGGGTANCT 
TCTGGAACGGGGCGTGAGGANACAAGGAGCCTCTGGGCCACNCCTCCCTACCAGATGCAG 
GAACTCCTGGACTCCTTGGTGGGCTGGCCCTGGCTAGCCCTTGGGCCTCGGAGATGATCA 

GAGGTGAAGAACCGCC 

SEQ ID NO: 98_AA498104_M H29974_M 

CCGTTGCTGCTCCCCCCGCCCCCCGCAGCCATGGAAACGGGGAAAGAGAACGGAGCCCGC 
AGAGGGACAAAAAGCCCGGAGCGGAAAAGGCGAAGCCCAGTCCAGCGGGTACTGTGCGAG 
AAGCTGAGGCCGGCGGCCCAGGCCATGGATCCGGCTGGGGCCGAGGTCCCGGGCGAGGCC 
TTCCTGGCCCGGCGGCGGCCGGATGGCGGCGGCGGGGATGTTCCTGCACGGCCGCGCTAC 
AGCCTCTTGGCGGAGATCGGGCGCGGCAGCTACGGCGTGGTTTATGAGGCTGTGGCTGGG 
CGCAGTGGGGCCAGGGTGGCAGTCAAGAAGATCCGCTGCGACGCTCCCGAGAACGTGGAG 
TTGGCACTAGCAGAATTCTGGGCCCTCACCAGTCTCAAGCGGCGGGACCAGAATATCGTG 
CAGTTTGAGGAGTGCGTCCTACAGCGCAACGGGTTAGCCCAGCGCATGAGTCACGGCAAC 
AAGAACTCACAGCTTTACCTGCGCCTGGTGGAGACCTCGCTCAAAGGAGAAAGGATCCTG 

GGCTATGCTGAGGAGCCCTGCTATCTCTGGTTTGTCATGGAGTACTGTGAAGGTGGAGAC 
CTCAATCAGTATGTCCTGTCCCGGAGACCTGACCCAGCCACCAACAAAAGTTTCATGCTA 
CAGCTTACAAGCGCCATTGCCTTCCTGCATAAAAACCACATCGTGCACAGGGACCTAAAG 
CCAGACAACATCCTGATCACAGAGCGGTCTGGCACCCCCATCCTCAAGGTGGCAGACTTT 
GGACTGAGCAAGGTCTGTGCAGGGCTGGCACCCCGAGGCAAAGAGGGCAATCAAGATAAC 
AAAAATGTGAATGTGAATAAATACTGGCTGTCCTCAGCTTGTGGCTCAGACTTCTACATG 
GCTCCCGAAGTCTGGGAGGGACACTATACAGCCAAGGCGGACATCTTTGCTCTGGGCATT 
ATCATCTGGGCAATGATAGAAAGAATTACCTTTATTGACTCTGAAACCAAGAAGGAGCT " 

CTGGGGACCTACATTAAGCAAGGGACTGAGATCGTCCCTGTTGGTGAGGCGCTGCTAGAA 
AACCCAAAGATGGAGTTGCATATCCCCCAGAAACGTAGGACTTCCATGTCTGA 
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AAGCAGCTCTTGAAAGACATGTTAGCTGCTAACCCACAGGACCGACCTGATGCTTTTGAA 
CTTGAAACCCGAATGGACCAGGTCACATGTGCTGCTTAAACTCCAGGGCTGAACGTCTTG 
GGTGTTTTTAAACTAGGTCGATCCTTCGGGACCCACAGTCTCATCGTGTCTCGGACAGGA 
TGGCAGAGGGTACAGGTGGTGGTGATCTCCTGACAGCTGGACCTCCCACAATGTGAAGCT 
CACGCTTGGGCTGCCCACTCTACCCTTCTCTTTCTCCTTCAGTAGAATAATAATTGTTTT 
TCTAAACATTAAACCATCAAGACTTCTGAAGAGCAGAAGGCTACACTCTG 



SEQ ID NO: 99_AA21 5311_H 

CGRCCGCGCTACGGAAAGCCGGAGGGGGGCGGGGCCGTCGGCGTAAGGGGGTGTGTCCGC 

GCGCACCAGGGGGGGGGGCGGCGGGTGCTGAGTGGAGGGGGGGGCAGCGGAGGGGCGAGC 

TGCCCGATAATGGCGGCCTGCAGAGCCCATGAGAGGGAGAAGCGGCAGCGTCTACCCTGA 

GAAACCTCGACCTTGAAGATGGTGAGTAGCCAGCCAAAGTACGATCTAATACGGGAGGTA 

GGCCGAGGTAGTTACGGTGTTGTGTATGAAGCAGTCATCAGAAAGACCTCTGCACGGGTG 

GCAGTGAAGAAAATTCGATGTCACGCACCTGAAAATGTTGAACTAGCCCTTCGTGAGTTC 

TGGGCACTAAGCAGTATCAAGAGCCAACATCCAAATGTGATTCACTTGGAGGAATGCATC 

CTACAAAAGGATGGGATGGTGCAAAAGATGTCCCACGGCTCTAATTCTTCGCTTTATTTA 

CAGCTTGTAGAAACTTCATTAAAAGGAGAAATTGCCTTTGATCCCAGAAGCGCCTATTAT 

TTGTGGTTTGTGATGGATTTTTGTGACGGAGGAGATATGAATGAGTATCTGTTGTCCAGG 

AAGCCCAATCGTAAAACTAACACCAGCTTCATGCTTCAGCTGAGCAGTGCCCTGGCTTTC 

TT G C ATAAAAAC C AG AT CATC C A CCG AG ATCTTAAGCCTGATAAC AT CCTG ATT TCTCAA 

ACCAGGTTGGATACCAGTGACTTGGAACCTACCCTCAAAGTGGCTGATTTTGGTCTAAGT 

AAAGTTTGTTCAGCCTCTGGGCAGAACCCAGAAGAACCTGTCAGTGTAAACAAGTGTTTC 

CTTTCCACAGCATGTGGAACAGATTTTTACATGGCTCCTGAAGTTTGGGAAGGACATTAC 

ACAGCAAAAGCTGACATCTTTGCTCTGGGGATTATCATCTGGGCAATGCTGGAAAGGATC 

ACATTCATAGACACAGAGACAAAGAAGGAACTCTTGGGGAGTTATGTAAAACAAGGAACT 

GAGATTGTGCCTGTTGGGGAGGCACTTCTGGAAAATCCCAAAATGGAACTTCTCATTCCT 

GTGAAGAAAAAATCTATGAATGGGCGAATGAAACAACTGATTAAGGAAATGCTGGCTGCA 

AACCCTCAGGATCGTCCAGATGCTTTTGAACTAGAACTCAGATTAGTACAAATTGCATTT 

AAAGATAGCAGCTGGGAAACGTGACACATATTATTTGCAAATACCATGGATGATATGCTG 

CTTCTGTTTAACAGTGATGCAACATTATGTGGCTGAAAAAGAATATAAAAAGCTAGACTC 

TACCCTCTAAGGGTTTAGATTTTTTGTGGGATTTTTTTTTTCCTCATTTTTCTTAAATCC 

AAGTTGGCCGTTTTATTAGTATGTTTCAAATGTGTATTACCAATGTGGGTGTAAATTTTT 

AAAAAATGATTATTGATAGAAGTTTGGCAGGAAAATTCTTTAAGAGCTAACAAGAGAAGA 

GAGTCCAGTTTTCTGGAAATATGTCTTTAAGTATTTTAGACATTCCTCGTCAGTATTAGG 

AATTTCCATGGGAAAAGAGGTTTGCATGCTGGTAATGCAACCTTTGAAACTTTGTAAAGG 

AAACATATATGTATATATTTATGTATATGTAAGTATGTGAATGTGCGCATTTTGCATTCC 

ATATGAAAAAAATGCCACGTCTGTTTAAATTATTTGATGTAGGTTTGGGTTTTTGAGATT 

TGCTGGTGAAGTCAGTGACGAAAAATAAACCTTCCCTTATCTTCCTACTCTGCCCCTCCC 

CCTAATGAAATCATATTAAGTNGTTTTTCCTNNTTTTTTTGTAATATACAGCTTTTTTTT 

TAAGGCATCATTTTCGAGGGTCTAAAATTATCTGGTAAAACAAATGAAATTAAGTGATCC 

AAAGCTGCTGAAGTATGTTTGAACTCTCCAGTGCCCTATAGCTGCAAGAGTTGAATTAGT 

CATGCAGTCATATGGCAGCAGGTTGGTGATT 

SEQ ID NO: 100 AA018361 H 



GCGGGGCCTCCGTATCCC 
AGGCCCAGAGACAGGGGA 
TGCATTTCCGGCGGGCGG 
GCAACCCGCCGGAACCGC 
AATGGCGGGGCCCGGCTG 
GGGCAGCGGCACGTACGC 



CACGTGGGCCCTGCAGGAACTGGC 
3GGGCGCCGGGAGCCGGGCGGATC 
IGCTGGGGGC AG ZGTGGAGCCACC 
ZGCCCGCAGCGAGGAAGCGCCCGC 
3GGTCCCCCGCGCCTGGACGGOTT 
IACGGTGTACAAGG CCTACGCCAA 



jGGGCGCGTGACCCGG 
CGCGTCCCCGATGCGC 
rAGTGCTCGGCCCGCC 
3CGGGCGCAGGCGGCC 
IATCCTCACCGAGCGC 
3AAGGACACTCGTGAAG7 



j -.j 
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FIGURE 2YYY 

GGTAGCCATAAAGTGTGTAGCCAAGAAAAGTCTGAACAAGGCATCGGTGGAGAACCTCCT 

CACGGAGATTGAGATCCTCAAGGGCATTCGACATCCCCACATTGTGCAGCTGAAAGACTT 

TCAGTGGGACAGTGACAATATCTACCTCATCATGGAGTTTTGGGCAGGGGGCGACCTGTC 

TCGCTTCATCCATACCCGCAGGATTCTGCCTGAGAAGGTGGCGCGTGTCTTCATGCAGCA 

ATTAGCTAGCGCCCTGCAATTCCTGCATGAACGGAATATCTCTCACCTGGATCTGAAGCC 

ACAGAACATTCTACTGAGCTCCTTGGAGAAGCCCCACCTAAAACTGGCAGACTTTGGTTT 

CGCACAACACATGTCCCCGTGGGATGAGAAGCACGTGCTCCGTGGCTCCCCCCTCTACAT 

GGCCGCCGAGATGGTGTGCCAGCGGGAGTATGACGCCCGCGTGGACCTCTGGTCCATGGG 

GGTCATCCTGTATGAAGCCCTCTTCGGGCAGCCCCCCTTTGCCTCCAGGTCGTTCTCGGA 

GCTGGAAGAGAAGATCCGTAGCAACCGGGTCATCGAGGTCCCCTTGCGGCCCCTGCTCTC 

CCGAGACTGCCGGGACCTACTGCAGCGGCTCCTGGAGCGGGACCCCAGCCGTCGCATCTC 

CTTCCAGGACTTCTTTGCGCACCCCTGGGTGGACCTGGAGCACATGCCCAGTGGGGAGAG 

TCTGGGGCGAGCAACCGCCCTGGTGGTGCAGGCTGTGAAGAAAGACCAGGAGGGGGATTC 

AGCAGCCGCGTTATCACTCTACTGCAAGGCTCTGGACTTCTTTGTACCTGCCCTGCACTA 

TGAAGTGGATGCCCAGCGGAAGGAGGCAATTAAGGCAAAGGTGGGGCAGTACGTGTCCCG 

GGCTGAGGAGCTCAAGGCCATCGTCTCCTCTTCCAATCAGGCCCTGCTGAGGCAGGGGAC 

CTCTGCCCGAGAGCTGGTCAGAGAGATGGCCCGGGACAAGGCACGCCTCCTAGCTGCCGT 

GGAAGTGGCTTCAGCTGCGATGGCCAAGGAGGAGGCCGCCGGCGGGGAGCAGGATGCCCT 

GGACCTGTACCAGCACAGCCTGGGGGAGCTACTGCTGTTGCTGCGGAGCCCCCGGGCCGG 

AGGCGGGAGCTGCTTCACACTGAGGTTCAGAACCTCATGGCCCGAGCTGAATACTTGAAG 

GAGCAGATGAGGGAATCTCGCTGGGAAGCTGACACCCTGGACAAAGAGGGACTGTCGGAA 

TCTGTTCGTAGCTCTTGCACCCTTCAGTGACCCTAGAAGAATGATTGGACAGATGTGAGC 

CATCTGGAGCAGAGGGGCACTAACCCAGGCTGACGCCAAGAATGAAGTGGCCCACTGCAG 

CCCTGGCGAGCAGGCTTCTTGGATGGACAGTGCTGAGACCGCCATATCCCAGAGTCCCCA 

GCCTCCCTCAGGTTACTCTGCACCCCACAGATGGTTTGATGGGTGTGCTGTATACTGGAG 

GGGAGGGCAGGACTCTGGGAGAACAGCACTTCTTTCATGAGACCTTTGTTACTCGGTGGT 

TACTGGGTCCTGTGCCTGTCCGTTTTGGGGCATGCAGCCCTCTATCATTTTTGGCTCCGA 

GAAGAGGGCAAGGGGCCCCCGCAGGGTACTTCTGTGCTTGCCCTCGCCCTGCCAGCAGGC 

AGCTGTGCCCGTGGCCTGCCCTTCCCGGGACGCCTTATTCCAACTCAGCTCCTCTTTGCA 

CTGGAATGGGGCACTCCAACACGCGTCAGGGACCACCCTCCCCACAGTATGCACTCAGGC 

CCACAGAACCCACCAGTCTTTCTGGGAACTCACACCTGCCCGCCATCTTGGTACTTTAGG 

TTAATCCCTCAAGCATGAAAGCTGGATCTTTTGGGGTTTAAGAAGCCCAAGCCTTGTTCC 

TGCCCTGGGCTAGGGAGCACTCAGGAGGGTTCCTTGGTCCTCATCTCTCCCACCTCCGTT 

CCCTCTGGGCCCCACAGTAGGCACAGCGCGGGCCTTGTGCTGGAGTTTGAGCCTGGGACA 

GGGAGAGGGAGGCTTGGAGACAGTCTGACCCAGTGCCCTCTAGGGCACCCACTTCTAGGC 

CTGCCCTGCCGCCGTGGAGCCCTGGGCAAGCTGTTTCCGGTTTGTGGGCCTGGGTCTCCC 

CATCTCTTCAATGGGGCTGATACGTTCAGAGCCCACAGCATGGGCAGTTATGAGGACAAA 

GTGAATTTAACCTGGAAAAGAATGTATTTGAGAGTTTCTTTTAAATAATCAGCGGGTGTT 

GGTGATTTGTAGCCCTTCTGCCCTTAAATGCTTCCTTGGGCAAGAGCTGTCTGTCCTCCC 

TGCAGGAGGCTGAGTGTGAAGAGTATCATTCATTGTTTCTCTATTAAATTATTTTCTCT 

SEQ ID NO: 1 01_AA3 1 1 7 14_H 

TGGACCTGTCCTGAGGCAGAGGCCGAGATGCGCGCAACCGCGGGAGCAGCCAAGTGGACT 
GGAGTCTTTTCTTGAGTTAGGTACCAGGAGCTAGAGATGCTGTTATTCTATCGTATGTGA 
GAAGTCGGCCCAGAGATGGAAAACTTTATTCTGTATGAGGAGATCGGAAGAGGAAGCAAG 
ACTGTTGTCTATAAAGGGCGACGGAAGGGAACAATCAATTTTGTAGCCATTGTTTGTACT 
GATAAGTGCAGAAGGCCTGAAATAACCAACTGGGTCCGTCTCACCCGTGAAATAAAACAC 
AAGAATATTGTAAGTTTTCATGAATGGTATGAAACAAGCAACCACCTCTGGCTAGTGXAT 
GAA/^ACCTCCCAGAAGATGTTGTGAGAGAATTTGGAATTGACCTGATTAGTGGATTACAT 
CATCTTCATAAACTTGGCATTCTCTTTTGTGACATTTCTCCTAGGAAGATACTCTTGGA/, 
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FIGURE 2ZZZ 

GGGCCTGGCACACTGAAGTTT/<JjCAACTTT 

GAAGAGTTCTTTGCTTTGGTGGCAGCAGAGGAAGGAGGAGGTGAT7\ATGGGGAA/^.^^3TC 
CT(] A A G AAA A G CAT G AAA/ i G T A G A G T C AA A G G A. T C 1 1 C G T G T A T A. T A C A G C A C C A G AA G T T 

GTGAGGGGTGGTGACTTTTCCATCTCCAGTGACCTCTGGTCTTTGGGCTGTCTGCTTTAT 
GAAATGTTTTCAGGAAAACCTCCATTCTTCTCAGAAAGTGTTTCAGAATTAACTGAAAAG 
ATCTTATGTGAAGATCCTTTGCCAC CTATTCCGAAAGATTCTTCTCGTCCTAAAGCTTCT 
TCAGATTTTATTAATTTGCTTGATGGGTTACTTCAAAGAGATCCTCAGAAAAGATTGACT 
TGGACAAGGCTACTGCAGCATTCATTTTGGAAGAAAGCTTTTGCTGGAGCAGATCAGGAA 
TCAAGCGTCGAAGATCTCAGTCTCAGCAGAAACACTATGGAGTGTTCTGGGCCACAAGAT 
TCCAAGGAGCTTTTGCAGAACTCTCAGAGTAGACAAGCAAAAGGGCACAAGAGTGGTCAA 
CCACTAGGTCACTCTTTCAGACTAGAAAATCCAACTGAGTTTCGGCCTAAGAGTACTCTT 
GAGGGTCAATTGAATGAATCCATGTTTCTTCTCAGTTCTCGTCCTACTCCCAGAACTAGC 
ACTGCAGTGGAAGTAAGTCCTGGTGAGGATATGACTCACTGTTCACCACAGAAGACTTCT 
CCTCTGACCAAGATTACAAGTGGACACCTGAGTCAGCAGGACCTGGAATCCCAGATGAGA 
GAGCTTATCTACACGGACTCAGATCTTGTTGTCACCCCCATTATCGACAATCCAAAGATA 
ATGAAACAGCCACCAGTTAAATTTGATGCAAAAATATTGCATCTACCAACATATTCAGTG 
GATAAGTTATTATTTCTGAAAGATCAAGATTGGAATGACTTTTTGCAACAAGTGTGCTCG 
CAGATCGACTCCACTGAGAAGAGCATGGGGGCCTCCCGAGCCAAGCTGAATCTCCTTTGC 
TATTTGTGCGTGGTGGCTGGTCACCAGGAGGTGGCCACCAGGCTCCTCCATTCCCCCCTG 
TTCCAATTGCTAATCCAGCATTTGCGGATAGCTCCAAACTGGGATATACGGGCCAAGGTT 
GCTCACGTGATTGGTTTACTGGCTTCGCACACAA.CTGAGCTCCAGGAAAATACACCTGTT 
GTTGAGACTACAAGCTCCATTGGAATCGGGATTTTGAACTGTCTTGTTCAACACTCCACT 
CCAGTGCCTAGACAGTGCCTTGTGTATGTATAGATACTGACAAATATTTCAAAATAAATA 
7VAACTGTATCAGCATT 

SEQ ID NO: 1 0 2_SGK3 8 4_H 

TCTTTGGCCCACGTGCTGAGGGCGCGGCAGATCCTGACGGAGCCAGAAGTGCGCGACTAC 
CTGCGGGGCCTGGTCAGCGGCCTGCGCTACCTGCACCAGCGGTGCATCCTGCACCGC 

SEQ ID NO: 1 0 3_AA2 1 0 4 5 1_M SGK3 84_M 

GGTCTGCTGCATGGATAATGGACTGGAACACAGAAAGACCATGCAGGGTTCGGCTGTAGA 
AGGCCAGTATCTCCAGAGGCCAGAAGACACCATCAGATCTCCTGGGACTGGAGTTATAGA 
GGTTGTGAGCTGCCATGTTGAACCAAGCAGGTCACTGAGGGACACAGGCATGTGGATGGA 
AACCCTGCTGGGAGAAAAAAGAAACTGCTGAAGGGACTGACATGGGACAGCAACATGGAA 
CCAGGAATGGTCTCACGCATAGAGAGCTCCCCCGGGGCGTGGGGCTGCTGCTCGCCATGG 
CCCTTATGAACGTGGCGCTCTACCTCTGCCTTGATCAGCTTTTCATCTCCCCTGGACGAT 
CCACCGCGGACTCTAGGCGCTGTCCTCCGGGCTACTTCAGAATGGGGCGGATGAGAAACT 
GCTCACGCTGGCTGTCCTGTGAAGAGCTGAGGACAGAAGTCAGGCAGCTGAAGCGCGTTG 
GGGAGGGAGCCGTGAAGAGAGTCTTTCTGTCTGAATGGAAGGAACACAAAGTCGCTCTCT 
CCCGGCTCACCAGGCTGGAGATGAAGGAGGACTTCCTGCATGGGCTGCAGATGCTGAAGT 
CTCTACAGAGTGAGCACGTGGTCACGGTGGTGGGCTACTGTGAGGAAGATGGCACTATTC 
TCACCGAlATATCACCCCTTAGGTTCCTTGAGCAACCTGGAAGAAACACTAAACCTTTCAA 
AGTACCAAGACGTGAACACTTGGCAGCACAGGCTGCAGCTGGCCATGGAGTACGTCAGCA 
TCATTAACTATCTGCATCACAGCCCCCTGGGCACGAGGGTCATGTGTGACTCTAACGACC 
TGCCCA/iAACATTGTCCCAGTACCTGCTAACAAGTAACTTCAGCATTGTGGCAAACGACC 
TGGACGCTCTGCCCCTGGTAGA.CCATt jACTCTGGGGTACTTATAAAGTGTGGCCACAGA ( "t 

AGCTCCATGGGGATTTTGTGGCTCCAGAGCAGCTGTGGCCCTACGGAGAAGACACGCCC'I' 
TCC/vAGACG7\TCTCATGCCTTCCTACAATGAGAAGGTTGACATCTt^GAAGATTCC\AGATL' 
TCTCCAGTTTCCTCTTGGGGCACGTGGAAGGGAGTGATATGGTTAGATTCCATTTGTTTG 
ATATCCATAAGGCGTGCAAGAGCCAGATCCCGGCAGAAlAGAC CCACTGCTCAGAACGTGC 
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FIGURE 2AAAA 

TAGACGCTTACCAGAGGGTTTTCCATTCACTCCGAGACACTGTGATGTCGCAGACGAAAG 
AAATGCTGTAAAAATGAGCCATCGAGTGACGTGCTTGATGGCTG7VATGGCATCCCAGCTG 
TTCCGCTCTTGATGATGGAAGAGCTTTGCATGGATGGATGTTGACCCTGGCTGTTCAGCC 
ACGTAGGCCTCCTCTACGTCTGCCTGCATGTTTGAGTGTTCTGCTCTCCTGGCAGCCCGG 
ATGGAAGCTGCCAAGCGAGAAAGCCTGGCTTCAGGATGCTCCCTGGTGAAGATGCAGAGG 
ATTCTGGATCTGCATAGTTTCAAGGGAGTGATCAAACGGTGACCTTGAAGACATGCTGCC 
TGCCTTGGTAACTTTTTATAGACTAGTAGGAAACAGAAATCTTTTGGGGGAGGGGGGGAC 
AACCCACTAGTTCCTCAGAGACAATTTCTTCTCATTCAGAAAGCCCTGTTGGAAGCTGGG 
GATGTTTTAACTCCGTGGCAGGGCACTTGCCTAGTTGTGTGCAAAGCCTTGGATCTGACC 
CATGGCATGTGCACACACACAAATGCTCAAAGAAAATCCCAGACGCCAGAAGTGTGCCCC 
TTTCTTGTCAATAAGGTCATTGTCAGTACCGGAGATGATTTTTTTTATGAAGCGTTTATG 
CTGACTCGTGTCACTGAGCCAAGTGTGCATGGTCGTTAGCTACTTTGTGGGTTCTTCTTT 
CTTTCTACCCTACTTCTTCCCTTTCCACCCCCTAACACTAGATAGGAGAGAGGAGAGAGA 
AAGGAAAGTGGGCACTGTTATATTGTTGGACGACTTCTTGCTGATTAAGGGGTGTCGAGT 
TCCTTGGAGCAATGATCTTTGCTGCCAAGATATCTCATTTCTTCTTGTTTCTTCTTCGCC 
CACGACCACTTCACAAACACCGACCAACAGCAAACAACAACCCACCCCGGTTCTCGGGGG 
CCCTAGCACTTATGTACTTCTGAAAAGTCCCCAGAAATTCCAATCATCACACACTCAGAG 
AAACTGTCTGCTGCTGGCAAAACTACACCCCTGCTAGAGCATGAGGCAAATCATAGTCAG 
CTGCTGTGGACAGTCTGAAGCAGCCTGGCATCCCACACCTGAGATTAAAACAAAAACATT 
CTTACCTGTGTTTTGTTTTTGTTTTAAGAAACCAAAGTGCACCAAGATAGCATGCTCTTG 
AGATTGTGGCTGTCTAGAGATTTTTGGAACAGCAAGTTGAAGGAACTTTCTTACCTGCCT 
TGAATGGTGCTTTGAACTTCCTGCTGACCTGGAGTTTCTGTGTGAATATTTCTATCCAGT 
GTCCGCCTGTACCGGAAAGTACAAAGTCTGCTCTGGGCTTGCATGCCTGAACACTTTAAA 
ACACTGTGGAGCCAGGAATAATGGTACCCACCTGTAATCCCAGCACCTGGGAGACAGGAG 
GAACCAGGAGTTCAGGGTTATCCTGGGCTATATACCGTGACCCTGTCTACCCCCACACCC 
CAAT AAAAAAAC AAAAAGGT C 

SEQ ID NO: 1 04_SGK07 1_2_H 

GAGGTGGTGGCTGTGCAGATGATGGTGGAATGCATGGATGACCATTACGCCAGTCAGGCC 
CTGGAGGAGCTGATGCCACTGCTGAAGCTGCGGCACGCGCACATCTCTGTGTACCAGGAG 
GTGTTCATCACGTGGAATGGGGAGATCTCTTCTCTGTACCTCTGCCTGGTGATGGAGTTC 
AATGAGCTCAGCTTCCAGGAGGTCATTGAGGATAAGAGGAAGGCAAAGAAAATCATTGAC 
TGTGAGTGGATGCAGAATGTGCTGGGCCAGGTGCTGGACGCGCTGGAATACCTGCACCAT 
TTGGACATCATCCACAGGAATCTCAAACCCTCCAACATCATCCTCATCAGGAGTGACCAC 
TGCAAACTGCAGGACCTGAGTTCCAATGTGCTAATGACAGACAAAGCCAAATGGAATATT 
CGTGCGGAGGAAGACCCCTTTCGTAAGTCCTGGATGGCCCCTGAAGCCCTCAACTTCTCC 
TTCAGCCAGAAATCAGACATCTGGTCCCTGGGCTGCATCATTCTGGACATGACCAGCTGC 
TCCTTCATGGATGGCACAGAAGCCATGCATCTGCGGAAGTCCCTCCGCCAGAGCCCAGGC 
AGCCTGAAGGCCGTCCTGAAGACAATGGAGGAGAAGCAGATCCCGGATGTGGAAACCTTC 
AGGAATCTTCTGCCCTTGATGCTCCAGATCGACCCCTCGGATCGAATAACGATAAAGGAC 
GTGGTGCACATCACCTTCTTGAGAGGCTCCTTCAAGTCCTCGTGCGTCTCTCTGACCCTG 
CACCGGCAGATGGTGCCTGCGTCCATCACCGACATGCTGTTAGAAGGCAACGTGGCCA<3C 
ATTTTAGGTGATGCTGGGGACACAAAGGGGGAGCGTGCCCTGAAGCTCCTGTCCATGGCC 
TTGGCATCCTATTGTTTAGTTCCAGAGGGTTCATTATTTATGCCCCTGGCCTTGCTCCAC 
ATGCACGACCAGTGGCTCAGCTGTGACCAGGACAGAGTCCCTGGGAAGAGAGACTTTGCC 
TCCCTGGGGAAACTAGGGAAGCTGTTGGGCCCCATCCCAAAGGGTCTGCCGTGGCCCCCG 
GAGCTGGTGGAGGTGGTGGTCACGACCATGGAGCTACATGACAGGGTCCTCGATGTCCAG 
CTGTGTGCCTGCTCCCTGCTGCTGCACCTCCTGGGCCAAGCGCTGGTGCACCACCCGGAA 
GCCAAGGCTCCCTGCAACCAAGCCATCACCTCCACCCTGCTGAGTGCTCTTCAGAGCCAC 
CCCGAGGAGGAGCCACTTCTTGTCATGGTCTACAGCCTGCTAGCCATCACCACAACCCAG 
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GAGTCAGAGTCACTGTCAGAGGAGCTGCAGAACGCTGGGCTG CTGGAGCACATCCTGGAG 
CACGTCAACAGGTCCCTCGAAAGCAGGGACGTCTGCGCCAGCGGCCTGGGCCTGCTCTGG 
GCCCTCCTGCTGGACGACCCCATCTTGGCACTCCAGCGCCGCAGGAAAAAGAGAGCTCCA 
AACCACGGAAAGCCCGGGAAACCCAAGAACCCTGCCAGCACCCAAAGTATCATTGTGAAC 
AAGGCCCCCTTGGAGAAGGTCCCGGACCTCATCAGCCAGGTGTTGGCCACCTACCCTGCG 
GATGGGGAAATGGCAGAAGCCAGCTGCGGAGTCTTCTGGCTGCTGTCCCTGCTGGGCTGC 
ATCAAGGAGCAGCAGTTTGAACAAGTGGTGGCGCTGCTCCTGCAAAGCATCCGGCTGTGC 
CAGGACAGAGCCCTGCTGGTGAACAATGCCTACCGGGGACTGGCCAGCCTGGTGAAGGTG 
TCAGAGCTGGCGGCCTTCAAGGTGGTGGTGCAGGAGGAGGGCGGCAGTGGCCTCAGCCTC 
ATCAAGGAGACCTACCAGCTCCACAGGGACGACCCGGAGGTGGTGGAGAACGTGGGCATG 
CTGCTGGTCCACCTGGCTTCCTATGAGGAGATCCTGCCGGAGCTGGTGTCCAGTAGTATG 
AAGGCCCTGCTCCAGGAGATCAAGGAGCGCTTCACCTCCAGCCTGGTGAGTGACAGCAGC 
GCCTTCAGCAAACCAGGCCTCCCTCCAGGTGGAAGCCCCCAGCTGGGGTGCACCACGTCT 
GGGGGACTGGAATAG 

SEQ ID NO: 105_AA118352_M SGK071_M 

CAGAAGAAGACCCCTGCCAGAAGTCCTGGATGGCTCCTGAAGCTCTCAAATTCTCCTTCT 
CCACCAAATCCGACATCTGGTCTCTGGGCTGCATCATTCTAGACATGGCCACTTGCTCCT 
TCCTGAACGACACAGAAGCCATGCAACTGCGGAAGGCCATCCGCCATCATCCAGGCAGCC 
TGAAGCCCATCCTGAAAACCATGGAGGAGAAGCAAATCCCTGGTACAGATGTCTACTATT 
TGCTTCTGCCCTTCATGTTGCATATCAACCCCTCCGATCGACTGGCAATCAAGGATGTGA 
TGCAAGTCACCTTCATGAGCAACTCCTTCAAAAGCTCCTCTGTTGCGCTGAATATGCAGC 
GGCAGAAGGTCCCCATCTTCATCACTGACGTGCTGCTTGAAGGCAACATGGCCAACATCT 
TAGGCAGCTGGCTGTGTGCTTCCTTTGTGAACGACAGCAGGCACTGTGACTCAGGGATTG 
GCTCGCAGAGACTTGGGTTTGATTTTCAGTCAGTCTCTTGGACAGAGCACCCTCTGAAAG 
ATGTCATGCAGAATTTCTCCAGTCGACCAGAGGTCCAGCTCAGAGCCATTAACAAGTTGT 
TGACAATGCCAGAGGACCAGCTAGGGCTGCCATGGCCCACAGAGCTGCTGGAAGAGGTGA 
TCAGCATCATAAAGCAGCATGGGCGGATCCTGGATATTCTGCTCAGCACCTGCTCCCTTC 
TGCTGCGTGTTCTTGGCCAAGCACTGGCAAAGGACCCAGAAGCTGAGATCCCAAGGAGCA 
GTTTGATCATCTCCTTCCTGATGGATACCTTGCGGAGCCATCCTAACTCTGAAAGGCTTG 
TTAATGTGGTCTACAACGTGCTTGCCATTATTTCCAGCCAAGGACAGATCTCAGAAGAGC 
TGGAAGAGGAGGGGTTGTTTCAGCTTGCCCAAGAGAACCTGGAGCACTTCCAAGAGGACA 
GGGACATCTGCCTCTCTATCCTGAGCCTGCTCTGGTCCCTCCTGGTAGATGTTGTCACTG 
TGGACAAAGAGCCCTTGGAGCAGCTCTCTGGCATGGTCACCTGGGTGCTGGCTACTCATC 
CGGAGGACGTGGAAATAGCAGAGGCTGGCTGTGCGGTGCTCTGGCTGCTGTCCTTGTTGG 
GCTGCATAAAGGAGAGTCAGTTTGAGCAGGTGGTAGTGCTGCTCCTGAGAAGCATCCAGC 
TGTGCCCTGGCAGAGTACTGCTGGTGAACAATGCATTCCGTGGCTTGGCCAGCCTCGCAA 
AGGTGTCCGAACTGGTGGCCTTCCGAATAGTAGTACTGGAAGAGGGCAGCAGCGGCCTC*:: 
ACCTCATCCAAGATATCTACAAGCTCTACAAGGATGACCCTGAGGTGGTGGAGAACCTCT 
GCATGCTGTTGGCCCATCTGACCTCCTACAAGGAGATCCTGCCAGAGATGGAGTCTGGAG 
GCATCAAAGACCTAGTCCAGGTGATCCGGGGGCGCTTTACCTCCAGCCTGGAGCTGATTT 
CTTACGCTGATGAGATACTCCAGGTACTGGAAGCAAATGCACAACCTGGCCTCCAGGAGG 
ATCAGCTTGAGCCTCCTGCAGGGCAGGAAGCCCCACTGCAGGGAGAGCCCCTCTTCAGGii: 
CCTGACATGCTGCCCTTCTGGTCCTGTGGTAAGAGAAAGTATCACTAGGTCCAGTATTA/i 
TTTCGTACCCCATGGTGACTAATAAAAGAAGCCCTAGGCTGTTTCTGGC 

SEQ ID NO: 106_018653 . 9_H 

GGCCGGGGTCGGGGCGCGGGGCATGCGCGCGGGCTGGGCAGGGGGCCGGCGGGGCGCAGA 
GCGGAGCCGCCTCGGAGCCTGAGCCGCCCGGGGCCGGGGCCGGGGAGCCGCGCGGGGCCG 
GCCGGCCGGGGGGAGGGGAGCGATGCGGCGCCGGCGGGCGGCAGTGG CCGCGGGTTTCTG 
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CGCCTCCTTCCTGCTGGGCTCCGTCCTCAACGTGCTCTTCGCTCCGGGTCGGAGCCTCCG 

AGGCCAGGCCAGTCCCGTGAGCCTTCGCCGGCCCCGGGTGCGGGGGGTCGGGGGGGCGGC 

GGGGAGCTGGCCCGGCAGATCCGGGCGCGCTACGAGGAGGTGCAGCGCTATTCCCGCGGG 

GGCCCCGGGCCCGGGGCGGGCCGGCCGGAGCGGCGGCGCCTGATGGACCTGGCTCCGGGC 

GGGCCCGGCCTGCCGCGCCCCCGGCCCCCTTGGGCCCGGCCCCTGTCCGACGGCGCCCCA 

GGCTGGCCCCCGGCTCCCGGCCCAGGCTCCCCCGGGGCGGGCCCGGGCCTGGGCTGCGCC 

GCGCTTGGCAACGTGTCCGGCGCGCAGTACATGGGGTCAGGCTACAGCAAGGCCGTGTAC 

CGGGTCCGCCTGGGGGGCGGTGGGGCGGTGGGGCTGAAGGGGGTGGAGTTTAGCGGCCAC 

GATCTGGGCAGCTGCGTGCGCGAGTTCGGGGTACGGAGGGGCTGCTATCGGCTGGCGGCC 

CACAAGCTGGTTAAGGAGATGGTGCTGCTGGAGGGGCTGGGGCACGCCAACGTGCTGCAG 

CTCTATGGCTACTGCTACCAGGACAGCGAGGACATCCCAGACACCCTGACCACCATCACG 

GAGCTGGGCGCCCCTGTAGAAATGATCCAGCTGCTGCAAACTTCCTGGGAGGATCGATTC 

CGAATCTGCCTGAGCCTGGGCCGCCTCCTCCACCACCTGGCCCACTCCCCACTGGGCTCC 

GTCACTCTGCTGGACTTCCGCCCTCGGCAGTTTGTGCTGGTGGATGGGGAGCTCAAAGTG 

ACGGACCTGGATGACGCACGTGTGGAGGAGACGCCGTGTGCAGGCAGCACCGACTGCATA 

CTCGAGTTTCCGGCCAGGAACTTCACCCTGCCCTGCTCAGCCCAGGGCTGGTGCGAGGGG 

ATGAACGAGAAGCGGAACCTCTATAATGCCTACAGGTTTTTCTTCACATACCTCCTGCCT 

CACAGTGCCCCGCCTTCACTGCGTCCTCTGCTGGACAGCATCGTCAACGCCACAGGAGAG 

CTCGCCTGGGGGGTGGACGAGAGCCTGGCCCAGGTGGAGAAGGTGGTGCACCTGTACCGG 

AGCGGGCAGTATCTGCAGAACTCCACGGCAAGCAGCAGTACCGAGTACCAGTGTATCCCA 

GACAGCACCATCCCGCAGGAAGACTACCGCTGCTGGCCATGCTACCACCACGGGAGCTGC 

CTCCTTTCAGTGTTCAACCTGGCTGAGGCTGTGGATGTCTGTGAGAGCCATGCCCAGTGT 

CGGGCCTTTGTGGTCACCAAGCAGACCACCTGGACAGGTCGGCAGCTGGTCTTTTTCAAG 

ACTGGATGGAGCCAAGTGGTCCCTGATCCCAACAAGACCAGATATGTGAAGGGCTCTGGG 

TGACCTATCTGAGGGCTCGGGTGACCAGCTGACTATCGTGAGCAGGTGGGCTTGCGTGTG 

GAGGGAGTGACTTGCACTGGCAGCACTGCATGTCACCTGGGAACCCGTGCAGACAAAGCT 

AACATCCCAGACAGACAGATGTGACCAGGACAAACGTGCAATAATGCCAAATGTTAAAAT 

GTGAGTTTAGCAGCCTAGCTATGGGACTGCTGGCTCCTAGTGGAGGAATCATGGGGGTAT 

GACTGCCTCTCGAACCGTGTGGGCTGTAAGCAAGCTCAGGGTAGTCTCCCCACTGGGGGG 

TGTGCCCCTCCCTGGGACGGTTCCGTGGGCAGGCCGATCACTGTGTTCAATAGTGTGAGA 

ATGTAGCTAAAGCCCCTGCTGCTGCTGCTGCACATGGCACAGCAGGGGGTGGGGGCTGGG 

TGGGGACAATCCATCGTGGAGTGTTGTCTCAGGTTAGGTGTGGACAGGAGAGTTGGCGGG 

AGATGCTCCAGGATGTGGGTGATTCTGTACCTGGGGAGGCTATCTCTGACCTCCCGACAG 

GGGACACTCCCAGGCCAGCCCAGGGGTCAGGGGCAGAGGTGGACACCTCAGCATGAGCCA 

AGACTGGGGTCAGGGAGCAGGTGTGGTTTGAGCCAGGAGCTGGGGGGGGGGTGGGGGCGG 

GGCCTTTCTGCCTCATTTGCTTTCAATGAAAGCGTCAAAGCAGCCAAAACCAGGCTTTCC 

CCCTTGCTGGAGTTTGAATATGCAGAATCTTTTGTAGTTCTTGTTGGTTAAATTGTTTAT 

TTTTGTAAAAAATAAAATAAAATTAGTTAATA7WVATGATGTTTCACAGCAAACTCTTCCC 
T 

SEO ID NO: 107__AA396601_M 

CCACGCGTCCGGGCTGCGCCGCGCTCCGCAACGTGTCTGGCGCGCAGTACGTGGGCTCAG 
GGTACAGTAAGGCTGTGTACCGGGTCCGCCTGCCCGGGGGCGCCGCGGTGGGGGTTAAAG 
CAGTGGAGTTGAGCGGCCAGGATGTGGGCAGGTGGGTGCGGGAGTTCGGGGCGCGAAGGG 
GCTGCTATCGCCTGGCGGGCCACAAGCTGCTCAAAGAGATGGTGCTGCTGGAGCGGCTGC 
GGCACCCCJ\ACGTGCTGCAGCTCTATGGCTATTGCTACCAGGACAGTGAGGGCATCCCAG 
ACACGCTGACCACCATCACAGAGCTGGGTGCCCCTGTGGAGATGATCCAGCTGTTGCAGA 
CTTCCTGGGAGGATCGATTCCGAATCTGCCTCAGCCTTGGCCGCCTCCTCCACCACCTGG 
CCCACTCCCCGCTGGGCTCGGTCACCCTGCTTGACTTCCGCCCTCGGCAGTTTGTGCTAG 
TGAACGGGGAGCTGAAAGTGACAGACCTGGATGATGCCCGCGTGGAAGAGACACCGTGCA 
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CCAGCAGTGCCGACTGCACGCTAGAGTTTCCAGCCAGGAACTTCAGCCTGCCCTGCTCGG 
CCCAGGGCTGGTGCGAGGGCATGAATGAGAAACGGAACCTCTACAATGCCTACAGGTTCT 
TCTTCACATACCTCCTGCCACACAGTGCCCCGCCTTCCCTCCGACCTCTCCTGGATAGCA 
TCGTCAATGCCACGGGAGAGCTCGCCTGGGGGGTGGATGAGACCCTGGCCCAGCTGGAGA 
CAGCGCTACACTTGTTCCGAAGTGGGCAGTACCTGCAGAACTCTACAAGCAGCAGGGCTG 
AGTACCAGCGCATCCCGGACAGTGCCATCACACAGGAGGACTATCGCTGCTGGCCATCCT 
ATCACCACGGCGGGTGCCTCCTGTGCGTGTTCAACCTGGCTGAGGCTATAGATGTGTGTG 
AGAGCCATGCTCAGTGTCGTGCCTTTGTGGTCACCAACCAGACCACCTGGACAGGTCGGA 
AGCTGGTCTTTTTTAAGACTGGATGGAACCAAGTGGTCCCTGATGCCGGCAAGACCACAT 
ATGTGAAGGCCCCTGGTTGACTGGTTGTGGGCTCAGCTGAGCAGCTGGGCTTGCCTGGTG 
CAGGCGTGACTTGCATCCCACCTGGGAACCCCTGCAGACAAAAGCTAGCTCCCAGAGCAA 
CTGATGTGACCAGGACAAAACGTGCAATATGCAAAAATGTTAAAATGTGAGTTTGCCAGC 
TTCAGTCCCAGACTGGTTGGAACCCGATTGCCTCTCTGGAGCTGTAGGCTGTGAGCAGGG 
CTCAGGCTGGTCTTAACTGGGACAGTGCCGTGGGCAGCCCATTACTGCATTCATGCTTTG 
AGAATGTAGCCAGAACACTGCTGCTGCATAAGCCACCGTGGGCAGGAGCTGCCTGGGGAC 
AACCAGTCTGAGAGTGCTCTCTCAGCTCAGCTGCGCTCCAAATGGAGAGCGCGGGATGCG 
GAGATGTGAGTGAACCAGCACTGGGAAGAAGGCTCTCGGGCCTCTCCCTAGAGGTTGCTC 
CTAGGCCAGCCCCGAGGCCGTGGGCAGCAGTGGTCGCATCCATATGAGCCAAGACTAGAG 
TGGAGGAGCAGATTGCATTTGAGCCAGGACTGGGGTGGGGGTAGGGTCGGGGGCTCTCTG 
CCTCATTTGGTTTCAGTGAAAGCGAGGGAGCAGCCGCAGCCAGGCTCCTCCCACTCGTGG 
AGGCCAGGCTCCTCCCCCTCCTGGAGGCCAGGCTCCTCCCCCTCCTGGAGTTTGCGTACC 
CAGAAGCTTTTATACTTCTCGTTCATTAAATTGTTTATTTTTGTAAAAAAAAAAATTAAT 
CAATTAATAAAATGATGTTTTGTGAC 

SEQ ID NO: 108_VRK3_H 

ATGATCTCCTTCTGTCCAGACTGTGGCAAAAGTATCCAAGCGGCATTCAAATTCTGGCCG 
TACTGTGGAAATTCTTTGCCTGTAGAGGAGCATGTAGGGTCCCAGACCTTTGTCAATCCA 
CATGTGTCATCCTTCCAAGGCTCAAAGAGAGGGCTGAACTCCAGTTTTGAAAGGTCTCCT 
AAGAAAGTGAAATGGTCCAGCACCGTCACCTCTCCCCGATTATCCCTCTTCTCAGATGGT 
GACAGTTCTGAGTCTGAAGATACTCTGAGTTCCTCTGAGAGATCCAAAGGCTCCGGGAGC 
AGACCCCCAACCCCCAAAAGCAGCGCTCAGAAGACCAGGAAGAGCCCTCAGGTGACGAGG 
GGTAGCCCTCAGAAGACCAGCTGTAGCCCTCAGAAGACCAGGCAGAGCCCTCAGACGCTG 
AAGCGGAGCCGAGTGACCACCTCACTTGAAGCTTTGCCCACAGGGACAGTGCTGACAGAC 
AAGAGTGGGCGACAGTGGAAGCTGAAGTCCTTCCAGACCAGGGACAACCAGGGCATTCTC 
TATGAAGCTGCACCCACCTCCACCGTCACCTGTGACTGAGGACCACAGAAGCAAAAGTTC 
TCACTCAAACTGGATGGCAAGGATGGGCGCTTGTTCAATGAGCAGAAGTTCTTCCAGCGG 
GCCGCCAAGCCTCTGCAAGTCAACAAGTGGAAGAAGCTGTACTCGACCCCACTGCTGGCC 
ATCCCTACCTGCATGGGTTTCGGTGTTCACCAGGACAAATACAGGTTCTTGGTGTTACCC 
AGCCTGGGGAGGAGCCTTCAGTCGGCCCTGGATGTCAGCCCAAAGCATGTGCTGTCAGAG 
AGGTCTGTGCTGCAGGTGGCCTGCCGGCTGCTGGATGCCCTGGAGTTCCTCCATGAGAAT 
GAGTATGTTCATGGAAATGTGACAGCTGAAAATATCTTTGTGGATCCAGAGGACCAGAGT 
CAGGTGACTTTGGCAGGCTATGGCTTCGCCTTCCGCTATTGCCCAAGTGGCAAACACGTG 
GCCTACGTGGAAGGCAGCAGGAGCCCTCACGAGGGGGACCTTGAGTTCATTAGCATGGAC 
CTGCACAAGGGATGCGGGCCCTCCCGCCGCAGCGACCTCCAGAGCCTGGGCTACTGCATC5 
CTGAAGTGGCTCTACGGGTTTCTGCCATGGACAAATTGCCTTCCCAACACTGAGGACATC 
ATGAAGCAAAAACAGAAGTTTGTTGATAAGCCGGGGCCCTTCGTGGGACCCTGCGGTCAC 
TGGATCAGGCCCTCAGAGAGCCTGCAGAAGTACCTGAAGGTGGTGATGGCCCTGACGTAT 
GAGGAGAAGCCGCCCTACGCCATGCTGAGGAACAACCTAGAAGCTTTGCTGGAGGATCTG 
CGTGTGTCTCCATATGACCCCATTGGCCTCCCGATGGTGCCCTAG 
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SEQ ID MO: 109_S71575_M VRK3_M 

CCATCCCCACCTGTATCGGCTTTGGCATTCACCAGGACAAGTACAGGTTCCTAGTATTCC 
CCAGCCTGGGGAGGAGCCTTCAGTCAGCCCTGGATGACAACCCAAAGCATGTGGTATCAG 
AGAGATGTGl'GCTTCAGGTGGCCTGCAGGCTGCTGGATGCTCTGGAGTATCTCCATGAAA 
ATGAGTATGTTCACGGGAACCTGACAGCTGAGAATGTCTTTGTGAATCCAGAGGATCTGA 
GCCAGGTGACCCTGGTGGGCTATGGCTTCACCTACCGATACTGCCCAGGTGGCAAACACG 
TGGCCTACAAAGAAGGCAGGAGGAGTCCACACGATGGGGACTTGGAGTTCATTAGCATGG 
ACCTGCACAAGGGATGCGGACCCTCCCGCCGCAGCGATCTCCAGACCTTGGGCTACTGTA 
TGCTCAAGTGGCTTTATGGGTCCCTGCCATGGACAAATTGCCTTCCCAACACCGAAAAGA 
TAACTAGGCAGAAGCAGAAGTATCTGGACAGCCCCGAGCGCCTCGTGGGACTGTGTGGCC 
GCTGGAACAAGGCCTCAGAGACCCTGCGGGAGTACCTGAAGGTGGTGATGGCCCTCAATT 
ATGAGGAGAAGCCACCCTATGCCACGCTGAGGAACAGCCTAGAAGCTCTGCTGCAGGATA 
TGCGGGTGTCACCCTATGACCCTCTGGACCTCCAGATGGTGCCTTAGATGGAATCCAGAG 
CTTCCGACTTGCAGCTTGAAGTAGAACATGAAGTAGTGTGACTGGAGGCCTGTTTGAACT 
CATAGCTCCTAAAAGAATCCCTTGAATGTGCATTCTCACCGCTCCCTTAGGACATATGAA 
TCAGCACTTGTGTTGGGGAACCTGAGTCATGTCATGTAATGTGAAACTCCTCCCTGTCTC 
AGCTCTGGCAGCTGTGGATGGAGGTAAGTGGATGCTGGCGGCGGCGGCGGCAGCAGCCAC 
TCCACTCCCTATGGCATTTCTGTGATGGCATAATAAACTGTTTTTAATC 

SEQ ID NO: 1 1 0_AA4 542 7_H 

ATGGGCCACGCGCTGTGTGTCTGCTCTCGGGGAACTGTCATCATTGACAATAAGCGCTAC 
CTCTTCATCCAGAAAGTGGGGGAGGGTGGGTTCAGCTATGTGGACCTAGTGGAAGGGTTA 
CATGATGGACACTTCTACGCCCTGAAGCGAATCCTGTGTCACGAGCAGCAGGACCGGGAG 
GAGGCCCAGCGAGAAGCCGACATGCATCGCCTCTTCAATCACCCCAACATCCTTCGCCTC 
GTGGCTTACTGTCTGAGGGAACGGGGTGCTAAGCATGAGGCCTGGCTGCTGCTACCATTC 
TTCAAGAGAGGTACGCTGTGGAATGAGATAGAAAGGCTGAAGGACAAAGGCAACTTCCTG 
ACCGAGGATCAAATCCTTTGGCTGCTGCTGGGGATCTGCAGAGGCCTTGAGGCCATTCAT 
GCCAAGGGTTATGCCCACAGAGACTTGAAGCCCACCAATATATTGCTTGGAGATGAGGGG 
CAGCCAGTTTTAATGGACTTGGGTTCCATGAATCAAGCATGCATCCATGTGGAGGGCTCC 
CGCCAGGCTCTGACCCTGCAGGACTGGGCAGCCCAGCGGTGCACCATCTCCTACCGAGCC 
CCAGAGCTCTTCTCTGTGCAGAGTCACTGTGTCATCGATGAGCGGACTGATGTCTGGTCC 
CTAGGCTGCGTGCTATATGCCATGATGTTTGGGGAAGGCCCTTATGACATGGTGTTCCAA 
7VAGGGTGACAGTGTGGCCCTTGGTGTGCAGAACCAACTCAGCATCCCACAAAGCCCCAGG 
CATTCTTCAGCATTGCGGCAGCTCCTGAACTCGATGATGACCGTGGACCCGCATCAGCGT 
CCTCACATTCCTCTCCTCCTCAGTCAGGTGGAGGCGCTGCAGCCCCCAGCTCCTGGCCAA 
CATACTACCCAAATCTGA 

SEQ ID NO: 111_H05721_H 

CCCTGAGGCACCGCCCCAAGTTTGGTGTGACCGGCGGGGGACGCCGGTGGTGGCGGCAGC 
GACGGCTGCGGGGGCACCGGGCCGCGGCGCCACCATGGCGGTGCGACAGGCGCTGGGCCG 
CGGCCTGCAGCTGGGTCGAGCGCTGCTGCTGCGCTTCAGGGGCAAGCCCGGCCGGGCCTA 
CGGCTTGGGGCGGCCGGGCCCGGCGGCGGGCTGTGTCCGCGGGGAGCGTCCAGGCTGGGC 
CGCAGGACCGGGCGCGGAGCCTCGCAGGGTCGGGCTCGGGCTCCCTAACCGTCTCCGCTT 
CTTCCGCCAGTCGGTGGCCGGGCTGGCGGCGCGGTTGCAGCGGCAGTTCGTGGTGCGGGC 
CTGGGGCTGCGCGGGCCCTTGCGGCCGGGCAGTCTTTCTGGCCTTCGGGCTAGGGCTGGG 
CCTCATCGAGGAAAAACAGGCGGAGAGCCGGCGGGCGGTCTCGGCCTGTCAGGAGATCCA 
GGCAATTTTTACCCAGAAAAGCAAGCCGGGGCCTGACCCGTTGGACACGAGACGCTTGCA 
GGGCTTTCGGCTGGAGGAGTATCTGATAGGGCAGTCCATTGGTAAGGGCTGCAGTGCTGC 
TGTGTATGAAGCCACCATGCCTACATTGCCCCAGAACCTGGAGGTGACAAAGAGCACGGG 
GTTGCTTCCAGGGAGAGGCCCAGGTACCAGTGCACCAGGAGAAGGGCAGGAGCGAGCTCC 
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GGGGGCCCCTGCCTTCCCCTTGGCCATCAAGATGATGTGGAACATCTCG'jCAGGTTCCTC 

CAGCGAAGCCATCTTGAACACAATGAGCCAGGAGCTGGTCCCAGCGAGCCGAGTGGCCTT 

GGCTGGGGAGTATGGAGCAGTCACTTACAGAAAATCCAAGAGAGGTCCCAAGCAACTAGC 

CCCTCACCCCAACATCATCCGGGTTCTCCGCGCCTTCACCTCTTCCGTGCCGCTGCTGCC 

AGGGGCCCTGGTCGACTACCCTGATGTGCTGCCCTCACGCCTCCACCCTGAAGGCCTGGG 

CCATGGCCGGACGCTGTTCCTCGTTATGAAGAACTATCCCTGTACCCTGCGCCAGTACCT 

TTGTGTGAACACACCCAGCCCCCGCCTCGCCGCCATGATGCTGCTGCAGCTGCTGGAAGG 

CGTGGACCATCTGGTTCAACAGGGCATCGCGCACAGAGACCTGAAATCCGACAACATCCT 

TGTGGAGCTGGACCCAGACGGCTGCCCCTGGCTGGTGATCGCAGATTTTGGCTGCTGCCT 

GGCTGATGAGAGCATCGGCCTGCAGTTGCCCTTCAGCAGCTGGTACGTGGATCGGGGCGG 

AAACGGCTGTCTGATGGCCCCAGAGGTGTCCACGGCCCGTCCTGGCCCCAGGGCAGTGAT 

TGACTACAGCAAGGCTGATGCCTGGGCAGTGGGAGCCATCGCCTATGAAATCTTCGGGCT 

TGTCAATCCCTTCTACGGCCAGGGCAAGGCCCACCTTGAAAGCCGCAGCTACCAAGAGGC 

TCAGCTACCTGCACTGCCCGAGTCAGTGCCTCCAGACGTGAGACAGTTGGTGAGGGCACT 

GCTCCAGGGAGAGGCCAGCAAGAGACCATCTGCCCGAGTAGCCGGAAATGTGCTTCATCT 

AAGCCTCTGGGGTGAACATATTCTAGCCCTGAAGAATCTGAAGTTAGACAAGATGGTTGG 

CTGGCTCGTCCAACAATCGGCCGCCACTTTGTTGGCCAACAGGCTCACAGAGAAGTGTTG 

TGTGGAAACAAAAATGAAGATGCTCTTTCTGGGTAACCTGGAGTGTGAAACGCTCTGCCA 

GGCAGCCGTCCTCGTCTGGTCATGGAGGGCAGCCCTGTGATGTCCCTGCATGGAGCTGGT 

GAATTACTAAAAGAACATGGCATCCTCTGTGTCGTGATGGTCTGTGAATGGTGAGGGTGG 

GAGTCAGGAGACAAGACAGGGCAGAGAGGGCTGGTTAGCCGGAAAAGGCCTCGGGCTTGG 

CAAATGGAAGAAGTTGAGTGAGAGTTCAGTGTGCAGTCCTCTGCTCACAGACATCTGAAA 

AGTGAATGGCCAAGGTGGTGTAGTAGATGAGGGTGGACTGAGGAGGGGTAGGGCTGCATC 

CACAGAGAGGATCCAGGGCAAGGCAGTGGCTGTCAGTGGGAGAGTTTGGGTGTGACCTTT 

GCCCCTAACACGAGGAACTCGTTTGAAGGGGGCAGGGTAGCATGTCTGATTTGGGACGTG 

GATGAAGGGAGAGATCAACATGGGTCAGCACGTTCAGTTAGGGGAGTGGGAAATTACATG 

AGGCCTGGGCGTCTGCGTTCCCAAGCTGTGCGTTCTGGACCAGCTACTGT^ATTATTAATG 

TCACTTAGCGAAAGTGACGGATGAGGAGTAAGTAAGTAAGTGTGGGGATTTAAAGTTGAG 

GGTTTCCGTGGTGAGTAGCCTCTCTTACAGGAATTGTGAAATATTAAATGCAAATTTACA 

AGTGGAGATGACGTATGTGCCTTGAAGTGAATATTTGGCTTTAAGAATGATTCTTCTTAT 

ACTGTGAAGGTGAGAATATTTTGTGGGCAGGTATCAACATTGGGGAAGAGATTTCATGTG 

TAAGTAAGTAACTTTATACATGATTTTTAGGAAGCTATTGCCTAAATCAGCGTCAACATG 

CAGTAAAGGTTGTCTTCAACTGAGAAAA 

SEQ ID NO: 112_AI086865_H 

AATGAGATGGAGAAGTACGAGCGGATCCGAGTGGTGGGGAGAGGTGCCTTGGGGATTGTG 
CAGCTGTGGCTGCGAAAGGCTGACCAGAAGCTGGTGATCATCAAGCAGATTCCAGTGGAA 
CAGATGACCAAGGAAGAGCGGCAGGCAGCCCAGAATGAGTGCCAGGTCCTCAAGCTGCTC 
AACCACCCCAATGTCATTGAGTACTACGAGAACTTCCTGGAAGACAAAGCCCTTATGATC 
GCCATGGAATATGCACCAGGCGGGACTGTGGGTGAGTTCATGCAAAAGCGCTGTAATTCG 
CTGCTGGAGGAGGAGACCATCCTGCACTTCTTCGTGCAGATCCTGCTTGCACTGCATCAT 
GTGCACACCCACCTCATCCTGCACCGAGACCTCAAGACCCAGAACATCCTGCTTGACAAA 
CACCGCATGGTCGTCAAGATCGGTGATTTCGGCATCTCCAAGATCCTTAGCAGCAAGAGC 
ACCCCATGCTATATCTCCCCTGAGCTGTGTGAGGGCAAGCCCTACAACCAGAAGAGTGAC 
ATCTGGGCCCTGGGCTG1GTCCTCTACGAGCTGGCCAGCCTCAAGAGGGCTTTCGAGGCT 
GCGAACTTGCCAGCACTGGTGCTGAAGATCATGAGTGGCACCTTTGCACCTATCTCTGAC 
CGGTACAGCCCTGAGCTTCGCCAGCTGGTCCTGAGTCTACTCAGCCTGGAGCCTGCCCAG 
CGGCCACCACTCAGCCACATCATGGCACAGCCCCTCTGCATCCGTGCCCTCCTCAACCTC 
CACACCGACGGCAGAGAAGTCCGTGGCCCCCAGCAACACAGGGAGCAGGACCACCAGTGT 
CCGCTGCA(jAGAGGCATCATCATGACATTCGGCAGCGGCAGCAATGGGTGCCTAGGCCAT 

Ut/ /I) 
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GGCAGCCTCACTGACATCAGCCAGCCCACCATTGTGGAGGCTTTGTTGGGCTATGAAATG 

GTGCAGCAAGTGGAGGAGGCCCTGAGCTTCACACTACTAGGCTCTGCACCCCTGGACCAG 

GAGCCTCTGCTGAGTATAGACCTGGGCACTGCTCACTCAGCTGCTGTGACTGGTGAGGAG 

GACTTGGGCTCTGGAGATGTAAACAGGTTACCCAGCTGGGAGAGAGGACATCTGCTGGCT 

GGTGTGGCGTCCAGCACTGATGTGTCTACCTTCTCTGAAGGTGACTGCAAGGAGCCTGAC 

AAGTGCTGCTGGAGACACAAGCAGTGCACTGGGCACATCATCTACCCTTTCGCCTCTGAr 

TGTGTCCGCCACAGCCTGCACCTACACTCTGTCAACCACTGCAACTGTAATTCTAGGCTG 

AAGGACTCTTCAGAGGATAGCAGCAGCTCCCGGGGCGCGGGCCCT^ACCTGCTCCCATGTC 

ATCGAGTCCCCTTGCTTTGAGCTCACACCGGAGGAGGAGCATGTGGAGCGATTCCGGTAT 

GGCTGGTGCAAAAGCTACAGACCTGTCTCTGTGGCAGTGATCCACCATCCACTCTACCAT 

GAGTGTGGGGCAGATGATCTAAATGXXAAGAAGAGGAAGAGGAGGAGGAGGAAAAGCAAG 

CCCCCCATCCCGACACAGGTGGGGCCCGCCACCGCCTCCCCTGACCTAGGCACCAGCATG 

GCCACTGGTACCCCTGACTCCACAGCGCCCATCACCATCTGGCGCTCTGAGAGCCCCACA 

GGGAAGGGTCAGGGCAGCAAGGTGATCAAGAAGGTA^AGAAGAAAAAGGAAAAAGAGAAA 

GACAAGGAGGAGATGGATGAGAAGGCAAAGCTGAAGAAJWLAGCCAAGAAAGGCCAGTTG 

ACTAAGAAGAAAAGCCCGGTTAAATTGGAGCCTTCCCCGCCAGACGTGAGCCGATCATTA 

AGCGCAAGACAGCTGGCCAGGATGTCCGAGTCCAGCCCAGAAAGCCGGGAAGAGCTGGAG 

AGCGAGGACAGTTACAATGGCCGGGGGCAGGGAGAACTGTGCAGCGAGGATATTGTGGAA 

TCATCATCGCCCAGGAAGAGAGAGAACACAGTCCAGGCCAAAAAGACAGGGGCAAAGCCC 

TCACAAGCCAGGAAGGTAAACAAGAGAAAATCTCCCCCAGGATCAAACCCCAACCTCAGT 

TGAGGCCAGGGTGGTCAGGGTGCAGAATAAATGCCATCGAGCCTGTGGCTGGCCCTCTGC 

TGCTGTTCTCTCCCTCCAACCTGGCTGTTTCTTGCGGGGCAAGGGGTGGGCTCAGGGCTG 

CAGGGGTTTCTCAAAGGCAATCCAGCTTTCACAAAGGAAGCCCATGGGAAGGCAGGTGGG 

AGGGAAAGGAAGGGGCACAGCCCTATTTCTTCCTACCTGCTAGGACAAGGTGGAAGAGTG 

TATCTGGGGTGGGAAGGAGGGCTTCCCCTCTCTGCTGCGAGAGACTGGTGTGTGTGAAAT 

CCACTTCTGGGACAGGCAGTACTGTCTGCAGCGATACCCCCAATAAACGGAACTTTTTAA 

ccc 

SEQ ID NO: 1 13_AA8 3 634 8_H 

ATGTCGGTGCTGGGCGAGTACGAGCGACACTGCGATTCCATCAACTCGGACTTTGGGAGC 

GAGTCCGGGGGTTGCGGGGACTCGAGTCCGGGGGCTAGCGCCAGTCAGGGGCCGCGAGCC 

GGCGGCGGCGCGGCGGAGCAGGAGGAACTGCACTACATGCCCATCCGCGTCCTGGGCCGC 

GGCGCCTTCGGGGAAGCCACGCTGTACCGCCGCACCGAGGATGACTGACTGGTTGTGTGG 

AAGGAAGTCGATTTGACCCGGCTGTCTGAGAAGGAACGTCGTGATGCCTTGAATGAGATA 

GTTATTCTGGCACTGCTGCAGCACGACAACATTATTGCCTACTACAATCACTTCATGGAC 

AATACCACGCTGCTGATTGAGCTGGAATATTGTAATGGAGGGAACCTGTATGACAAuAATC 

CTTCGTCAGAAGGACAAGTTGTTTGAGGAAGAGATGGTGGTGTGGTACCTATTTCAGATT 

GTTTCAGCAGTGAGCTGCATCCATAAAGCTGGAATCCTTCATAGAGATATAAAGACATTA 

AATATTTTTCTGACCAAGGCAAACCTGATAAAACTTGGAGATTATGGCCTAGCAAAGAAA 

CTTAATTCTGAGTATTCCATGGCTGAGACGCTTGTGGGAACCCCATATTACATGTCTCCA 

GAGCTCTGTCAAGGAGTAAAGTACAATTTGAAGTCTGATATCTGGGCAGTTGGGTGCGTC 

ATTTTTGAACTGCTTACCTTAAAGAGGACGTTTGATGCTACAAACCCACTTAACCTGTGT 

GTGAAGATCGTGCAAGGAATTCGGGCCATGGAAGTTGACTCTAGCCAGTACTCTTTGGAA 

TTGATCCAAATGGTTCATTCGTGCCTTGACCAGGATCCTGAGCAGAGACCTACTGCAGAT 

GAACTTCTAGATCGCCCTCTTCTCAGGAAACGCAGGAGGTCAAGCACTGTGACTGAAGCA 

CCCATTGCTGTAGTAACATCACGAACCAGTGAAGTCTATGTTTGGGGTGGTGGAAAATCC 

ACCCCCCAGAAACTGGATGTTATCAAGAGTGGCTGTAGTGCCCGGCAGGTCTGTGCAGGG 

AATACCCACTTTGCTGTGGTCACAGTGGAGAAGGAACTGTACACTTGGGTGAACATGCA/^ 

GGAGGCACTAAACTCCATGGTCAGCTGGGCCATGGAGACAAAGCCTCCTATCGACAGCCA 

AAGCATGTGGAAAAGTTGCAAGGCAAAGCTATCCGTCAGGTGTCATGTGGTGATGATTTC 
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ACTGTCTGTGTGACTGATGAGGGTCAGCTCTATGCCTTCGGATCAGATTATTATGGCTGC 

ATGGGGGTGGACAAAGTTGCTGGCCCTGAAGTGCTAGAACCCATGCAGCTGAACTTCTTC 

CTCAGCAATCGAGTGGAGCAGGTCTGGTGTGGAGATAATCATGTGGTGGTTGTGAC/^CGA 

AACAAGGAAGTCTATTCTTGGGGCTGTGGCGAATATGGACGAGTGGGTTTGGATTCAGAA 

GAGGATTATTATACACCACAAAAGGTGGATGTTCCCAAGGGGTTGATTATTGTTGCAGTT 

CAATGTGGCTGTGATGGGACATTTCTGTTGACCCAGTCAGGCAAAGTGCTGGCCTGTGGA 

CTCAATGAATTCAATAAGCTGGGTCTGAATCAGTGCATGTCGGGAATTATCAACCATGAA 

GCATACCATGAAGTTCCGTACACAACGTCCTTTACCTTGGCCAAACAGTTGTGCTTTTAT 

AAGATCCGTACCATTGCCCCAGGCAAGACTCAGACAGGTGCTATTGATGAGCGAGGCCGG 

CTGCTGACCTTTGGCTGCAACAAGTGTGGGCAGCTGGGCGTTGGGAACTACAAGAAGCGT 

CTGGGAATCAACCTGTTGGGGGGACCCCTTGGTGGGAAGCAAGTGATCAGGGTCTCCTGC 

GGTGATGAGTTTACCATTGCTGCCACTGATGAGAAAGTATTGAATTCTAAGACCATCCGT 

TCCAATAGCAGTGGCTTATCCATTGGAACTGTGTTTCAGAGCTCTAGCCCGGGAGGAGGC 

GGCGGGGGCGGCGGTGGTGAAGAAGAGGACAGTCAGCAGGAATCTGAAACTCCTGACCCA 

AGTGGAGGCTTCCGAGGAACAATGGAAGCAGACCGAGGAATGGAAGGTTTAATCAGTCCC 

ACAGAGGCCATGGGGAACAGTAATGGGGCCAGCAGCTCCTGTCCTGGCTGGCTTCGAAAG 

GAGCTGGAAAATGCAGAATTTATCCCCATGCCTGACAGCCCATCTCCTCTCAGTGCAGCG 

TTTTCAGAATCTGAGAAAGATACCCTGCCCTATGAAGAGCTGCAAGGACTCAAAGTGGCtr 

TCTGAAGCTCCTTTGGAACACAAACCCCAAGTAGAAGCCTCGGTAACTGAGCTTTTTGCC 

TTTGAATCACAACTAGTCACCTCGGCTGAATCCTGCAGTAACCTGTGCTGGGAAGGGAAC 

ACCACTGACTCCTCCTGCGTGTGCGTGCAGCTCTCTGCAGGTGGAGGTTGA 

SEQ ID NO: 114_R8666e_H, MKK6_H 

ATGAACTTGCTGCTCTCCTACCGCGATGTGCAGGACTACTCGGCCATCATTGAGCTGGTG 

GAGACGCTGCAGGCCTTGCCCACCTGTGATGTGGCCGAGCAGCATAATGTCTGCTTCCAC 

TACACTTTTGCCCTCAACCGGAGGAACAGGCCTGGGGACCGGGCGAAGGCCCTGTCTGTG 

CTGCTGCCGCTGGTACAGCTTGAGGGCTCTGTGGCGCCCGATCTGTACTGCATGTGTGGC 

CGTATCTACAAGGACATGTTCTTCAGCTCGGGTTTCCAGGATGCTGGGCACCGGGAGCAG 

GCCTATCACTGGTATCGCAAGGCTTTTGACGTAGAGCCCAGCCTTCACTCAGGCATCAAT 

GCAGCTGTGCTCCTCATTGCTGCCGGGCAGCACTTTGAGGATTCCAAAGAGCTCCGGCTA 

ATAGGCATGAAGCTGGGCTGCCTGCTGGCCCGCAAAGGCTGCGTGGAGAAGATGCAGTAT 

TACTGGGATGTGGGTTTCTACCTGGGAGCCCAGATCCTCGCCAATGACCCCACCCAGGTG 

GTGCTGGCTGCAGAGCAGCTGTATAAGCTCAATGCCCCCATATGGTACCTGGTGTCCGTG 

ATGGAGACCTTCCTGCTCTACCAGCACTTCAGGCCCACGCCAGAGCCCCCTGGAGGGCCA 

CCACGCCGTGCCCACTTCTGGCTCCACTTCTTGCTACAGTCCTGCCAACCATTCAAGACA 

GCCTGTGCCCAGGGCGACCAGTGCTTGGTGCTGGTCCTGGAGATGAACAAGGTGCTGCTG 

CCTGCAAAGCTCGAGGTTCGGGGTACTGACCCAGTAAGCACAGTGACCCTGAGCCTGCTG 

GAGCCTGAGACCCAGGACATTCCCTCCAGCTGGACCTTCCCAGTCGCCTCCATATGCGGA 

GTCAGCGCCTCAAAGCGCGACGAGCGCTGCTGCTTCCTCTATGCACTCCCCCCGGCTCAG 

GACGTCCAGCTGTGCTTCCCCAGCGTAGGGCACTGCCAGTGGTTCTGCGGCCTGATCCAG 

GCCTGGGTGACGAACCCGGATTCCACGGCGCCCGCGGAGGAGGCGGAGGGCGCGGGGGAG 

ATGTTGGAGTTTGATTATGAGTACACGGAGACGGGCGAGCGGCTGGTGCTGGGCAAGGGC 

ACGTATGGGGTGGTGTACGCGGGCCGCGATCGCCACACGAGGGTGCGCATCGCGATCAAG 

GAGATCCCGGAGCGGGACAGCAGGTTCTCTCAGCCCCTGCATGAAGAGATCGCTCTTCAC 

AGACGCCTGCGCCACAAGAACATAGTGCGCTATCTGGGCTCAGCTAGCCAGGGCGGCTAG 

CTTAAGATCTTCATGGAGGAAGTGCCTGGAGGCAGCCTGTCCTCCTTGCTGCGGTCGGTG 

TGGGGACCCCTGAAGGACAACGAGAGCACCATCAGTTTCTACACCCGCCAGATCCTG 2AG 

GGACTTGGCTACTTGCACGACAACCACATCGTGCACAGGGACATAAAAGGGGACAATGTi ? 

CTGATCAACACCTTCAGTGGGCTGCTCAAGATTTCTGACTTGGGCACCTCCAAGCGGCTCt 

GCAGGCATCACACCTTGCACTGAGACCTTCACAGGAACTCTGCAGTATATGGCCCCAGAA 
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ATCATTGACCAGGGCCCACGCGGGTATGGGAAAGCAGCTGACATCTGGTCACTGGGCTGC 
ACTGTCATTGAGATGGCCACAGGTCGCCCCCCCTTCCACGAGCTCGGGAGCCCACAGGCT 
GCCATGTTTCAGGTGGGTATGTACAAGGTCCATCCGCCAATGCCCAGCTCTCTGTCGGCC 
GAGGCCCAAGCCTTTCTCCTCCGAACTTTTGAGCCAGACCCCCGCCTCCGAGCCAGCGCC 
CAGACACTGCTGGGGGACCCCTTCCTGCAGCCTGGGAAAAGGAGCCGCAGCCCCAGCTCC 
CCACGACATGCTCCACGGCCCTCAGATGCCCCTTCTGCCAGTCCCACTCCTTCAGCCAAC 
TCAAGCACCCAGTCTCAGACATTCCCGTGCCCTCAGGCACCCTCTGAGCACCCACCCAGC 
CCCCCGAAGCGCTGCCTCAGTTATGGGGGCACCAGCCAGCTCCGGGTGCCCGAGGAGGCT 
GCGGCCGAGGAGCCTGCGTCTCCGGAGGAGAGTTCGGGGCTGAGCCTGCTGCACCAGGAG 
AGCAAGCGTCGGGCCATGCTGGCCGCAGTATTGGAGCAGGAGCTGCCAGCGCTGGCGGAG 
AATCTGCACCAGGAGCAGAAGCAAGAGCAGGGGGCCCGTCTGGGCAGAAACCATGTGGAA 
GAGCTGCTGCGCTGCCTCGGGGCACACATCCACACTCCCAACCGCCGGCAGCTCGCCCAG 
GAGCTGCGGGCGCTGCAAGGACGGCTGAGGGCCCAGGGGCTTGGGCCTGCGCTTCTGCAC 
AGACCGCTGTTTGCCTTCCCGGATGCGGTGAAGCAGATCCTCCGCAAGCGCCAGATCCGT 
CCACACTGGATGTTCGTTCTGGACTCACTGCTCAGCCGTGCTGTGCGGGCAGCCCTGGGT 
GTGCTAGGACCGGAGGTGGAGAAGGAGGCGGTCTCACCGAGGTCAGAGGAGCTGAGTAAT 
GAAGGGGACTCCCAGCAGAGCCCAGGCCAGCAGAGCCCGCTTCCGGTGGAGCCCGAGCAG 
GGCCCCGCTCCTCTGATGGTGCAGCTGAGCCTCTTGAGGGCAGAGACTGATCGGCTGCGC 
GAAATCCTGGCGGGGAAGGAACGGGAGTACCAGGCCCTGGTGCAGCGGGCTCTACAGCGG 
CTGAATGAGGAAGCCCGGACCTATGTCCTGGCCCCAGAGCCTCCAACTGCTCTTTCAACG 
GACCAGGGCCTGGTGCAGTGGCTACAGGAAGTGAATGTGGATTCAGGCACCATCCAAATG 
CTGTTGAACCATAGCTTCACCCTCCACACTCTGCTCACCTATGCCACTCGAGATGACGTC 
AT CTA C AC C CG CAT C AGGGG AGGG ATGGT AT G C CGC AT CTGGAGGGCC AT CTTGGC AC AG 
CGAGCAGGATCCACACCAGTCACCTCTGGACCCTGA 

SEQ ID NO: 115_PAK6_H 

ATGTTTGGGAAGAAAAAGAAAAAGATTGAAATATCTGGCCCGTCCAACTTTGAACACAGG 
GTTCATACTGGGTTTGATCCACAAGAGCAGAAGTTTACCGGCCTTCCCCAGCAGTGGCAC 
AGCCTGTTAGCAGATACGGCCAACAGGCCAAAGCCTATGGTGGACCCTTCATGCATCACA 
CCCATCCAGCTGGCTCCTATGAAGACAATCGTTAGAGGAAACAAACCCTGCAAGGAAACC 
TCCATCAACGGCCTGCTAGAGGATTTTGACAACATCTCGGTGACTCGCTCCAACTCCCTA 
AGGAAAGAAAGCCCACCCACCCCAGATCAGGGAGCCTCCAGCCACGGTCCAGGCCACGCG 
GAAGAAAATGGCTTCATCACCTTCTCCCAGTATTCCAGCGAATCCGATACTACTGCTGAC 
TACACGACCGAAAAGTACAGGGAGAAGAGTCTCTATGGAGATGATCTGGATCCGTATTAT 
AGAGGCAGCCACGCAGCCAAGCAAAATGGGCACGTAATGAAAATGAAGCACGGGGAGGCC 
TACTATTCTGAGGTGAAGCCTTTGAAATCCGATTTTGCCAGATTTTCTGCCGATTATCAC 
TCACATTTGGACTCACTGAGCAAACCAAGTGAATACAGTGACCTCAAGTGGGAGTATCAG 
AGAGCCTCGAGTAGCTCCCCTCTGGATTATTCATTCCAATTCACACCTTCTAGAACTGCA 
GGGACCAGCGGGTGCTCCAAGGAGAGCCTGGCGTACAGTGAAAGTGAATGGGGACCCAGC 
CTGGATGACTATGACAGGAGGCCAAAGTCTTCGTACCTGAATCAGACAAGCCCTCAGCCC 
ACCATGCGGCAGAGGTCCAGGTCAGGCTCGGGACTCCAGGAACCGATGATGCCATTTGGA 
GCAAGTGCATTTAAAACCCATCCCCAAGGACACTCCTACAACTCCTACACCTACCCTCGC 
TTGTCCGAGCCCACAATGTGCATTCCAAAGGTGGATTACGATCGAGCACAGATGGTCCTC 
AGCCCTCCACTGTCAGGGTCTGACACCTACCCCAGGGGCCCTGCC7VAACTACCTCAAAGT 
CAAAGCAAATCGGGCTATTCCTCAAGCAGTCACCAGTACCCGTCTGGGTACCACAAAGCC 
ACCTTGTACCATCACCCCTCCCTGCAGAGCAGTTCGCAGTACATCTCCACGGCTTCCTAC 
CTGAGCTCCCTCAGCCTCTCATCCAGCACCTACCCGCCGCCCAGCTGGGGCTCCTCCTCC 
GACCAGCAGCCCTCCAGGGTGTCCCATGAACAGTTTCGGGCGGCCCTGCAGCTGGTGGTC 
AGCCCAGGAGACCCCAGGGAATACTTGGCCAACTTTATCAAAATCGGGGAAGGCTCAACC 
GGCATCGTATGCATCGCCACCGAGAAACACACAGGGAAACAAGTTGCAGTGAAGAAAATG 

/'5/ll3 
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GACCTCCGGAAGCAACAGAGACGAGAACTGCTTTTCAATGAGGTCGTGATCATGCG3GAT 
TACCACCATGACAATGTGGTTGACATGTACAGCAGCTACGTTGTGGGCGATGAGCTCTGG 
GTGGTCATGGAGTTTCTAGAAGGTGGTGCCTTGACAGACATTGTGACTCACACCAG.AATG 
AATGAAGAACAGATAGCTACTGTCTGCCTGTCAGTTCTGAGAGCTCTCTCCTACCTTCA': 1 
AACCAAGGAGTGATTCACAGGGACATAAAAAGTGACTCCATCCTCCTGACAAGCGATGGC 
CGGATAAAGTTGTCTGATTTTGGTTTCTGTGCTCAAGTTTCCAAAGAGGTGCCGAAGAGG 
AAATCATTGGTTGGCACTCCCTACTGGATGGCCCCTGAGGTGATTTCTAGGCTACCTTAT 
GGGAGAGAGGTGGACATCTGGTCCCTCGGGATCATGGTGATAGAAATGATTGATGGCGAG 
CCCCCCTACTTCAATGAGCCTCCCCTCCAGGCGATGCGGAGGATCCGGGACAGTTTACCT 
CCAAGAGTGAAGGAGCTACACAAGGTTTCTTCAGTGCTCCGGGGATTCCTAGACTTGATG 
TTGGTGAGGGAGCCCTCTCAGAGAGCAACAGCCCAGGAACTCCTCGGACATCCATTCTTA 
AAACTAGCAGGTCCACCGTCTTGCATCGTCCCCCTCATGAGACAATACAGGCATCACTGA 

SEQ ID NO: 1 1 6_SURTK1 0 6_H 

ATGAATGATAGGAATGAGATTCAAATGGAAGCCAAACTCCAAAGTCTTACCATTATAGCA 

CAGGAAATTCTATGCAGATTCTTTATTACCCTTAGGAGACATGCACGTTTCCTGCTCACT 

AAACTAGGAAGGCAAGGAATGGCAAGGTCAGGAATTACTCACAGCTGTGCTGTGTGCATT 

CTCTGTGGGCCTAGCAGGGAAGGGGACAGCCCTGTGGCAATGGGCATGACACGGATGCTC 

CTGGAATGCAGTCTCAGTGACAAGTTGTGTGTCATCCAGGAGAAGCAGTATGAAGTGATT 

ATCGTCCCAACTTTGTTGGTTACTATCTTCCTCATCCTTCTTGGGGTCATCCTGTGGCTT 

TTTATCAGAGAACAAAGAACTCAACAGCAGCGTTCTGGACCTCAAGGCATTGCCCCTGTT 

CCTCCACCTAGGGACCTAAGCTGGGAAGCAGGACATGGAGGAAATGTGGCTTTGCCACTT 

AAGGAGACATCGGTGGAAAACTTTCTGGGAGCTACCACACCTGCCCTGGCTAAGCTGCAG 

GTGCCGCGGGAGCAACTCTCTGAAGTTCTGGAGCAGATTTGCAGTGGTAGCTGTGGGCCC 

AT GTTTC G AG C CAATATGAAC A CTGGGG AC CCTTCTAAGCCCAAGAGTGTTATTCTCAAG 

GCTTTAAAAGAACCAGCTGGGCTCCATGAGGTACAAGATTTCTTAGGGCGAATCCAATTC 

CATCAATACCTGGGGAAACACAAAAACCTGGTGCAGCTGGAAGGCTGCTGCACTGAAAAG 

CTGCCACTCTATATGGTGTTGGAGGATGTGGCCCAGGGGGACCTGCTCGGCTTTCTCTGG 

ACCTGTCGGCGGGATGTGATGACTATGGATGGTCTTCTCTATGATCTCACAGAAAAACAA 

GTATATCACATCGGAAAGCAAGTCCTTTTGGCGCTGGAATTCCTGCAGGAGAAGCATTTG 

TTCCATGGGGATGTGGCAGCCAGGAATATTCTGATGCAAAGTGATCTCACTGCTAAGCTC 

TGTGGATTAGGCCTGGCTTATGAAGTTTACACCCGAGGGGCCATCTCCTCTACTCAAACC 

ATACCTCTCAAGTGGCTTGCCCCAGAACGGCTTCTCCTGAGACCTGCTAGCATCAGAGCA 

GATGTCTGGTCTTTTGGGATCCTGCTCTATGAGATGGTGACTCTAGGAGCACCACCGTAT 

CCTGAAGTCCCTCCTACCAGCATCCTAGAGCATCTCCAAAGAAGGAAAATCATG7^AGAGA 

CCCAGTAGCTGCACACATACCATGTACAGTATCATGAAGTCCTGCTGGCGCTGGCGTGAG 

GCTGACCGCCCCTCACCTAGAGAGCTGCGCTTGCGCCTAGAAGCTGCCATTAAAACTGCA 

GATGACGAGGCTGTGTTACAAGTACCAGAGTTGGTGGTACCTGAACTGTATGCAGCTGTG 

GCCGGCATCAGAGTGGAGAGCCTCTTCTACAACTATAGCATGCTTTGAAGAGTCTCGGGC 

AAGAAACATTCATGCATGAGTATATGTTCTTGGAATCAATTCCTCTAAGAACAGAGAATG 

GTCTTTCCCAGGGACACAAAGGGAGAAATGGGACATGGATTCTTGATCTTCCTTTACACA 

TTTCTCGGGAAATCTGAAATGATGCTGGATGGGACTCTACACATCCTGAGCTAAGACATA 

CTGTCAGTCTCACTTCTGCTGTCCCAGTCCTAGAAATCCTGGGTAGAAGTGGTGGACCTG 

TGCAAAGGAGGTTTTAGAACTCTGCAGTATTTGTTGGGGCATGGCACAAATAAGCTCATC 

CCTCCCGTCCGAGGCTAGTTTCCTCTGGAACCACATTTTTATCTAGATGAAAATTTGGAA 

TGAAATGAAGGAATAGAAATCCAATAAAAGAGTTGAAGGGAAAGAAAATTTAAGGTTCTT 

CTTGCTCAGGATTACAGATATGGACCAACACCTCCTTCAAGAAAAGGTGGTAGGAC7iCA/\ 

AGTTCTTCAGTCCTGAGCCCTAC7iTGTGGGGCTGGAGGAGAACTATAACGGAJ\7\AAt"CT i 

TGAGTTTCACCTTAGGTATAGATAAAAGAA7v(3ATGGTCCCCTTTTATCTGATTCTGAGAC 

AGG T AAA TT CTGTTTGTT A. CT A CGTTTAATT AG AAGGTGG AGG AGT C ATTT C A TG ATTALA 
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GAACATTCAACATGTATTGTTCATTAAGCTAGCTTCCTAGTTCCGATTAGACTAAGGAGA 
CTAAGCCTAGAGAGTCAATGTTAGAACAGTGAAAAGAATTCTGTGTGTGTGTGTGTGTG^' 
GTGTGTGTGTGCACAATAAATAGGAAATGTAGAAACCAAGCAAGAAGGCTTAGTAGCTCA 
GTCTTTAACAAGGGCTAGAAAAGAATGTAATCTGATATGGAAGGATAGCAGCTTCTAATT 
TTCAATCATCTGTTGATATACTGTGAAAGTTATTTTATTAAATTAATATTTATTAAATGG 

SEQ ID NO: 1 1 7_AA0 98 024_M 

CTGCAGGAGAAGCACCTGTTTCATGGGGATGTGGCTGCCAGGAACATCCTGATCCAAAGT 
GACCTGACTCCCAAACTTTGTCATCTGGGCCTGGCTTATGAAGTTCATGCCCATGGGGCC 
ATCTCCTCTGGTCGATCCAGCACCATCCCTGTCAAGTGGCTTGGTCCAGAAAGGCTTCTC 
CTGAGACCTGCAAGCATCAGGGGAGATATTTGGTCGTTTGGGATCCTGGTTTATGAGATG 
GTGACTCTAGGAGCAGCACCATACCCTGAAGTCGGTCCCACCAGCATCCTACAATATCTT 
CAGAGAAAGAAAATCATGAAGAGACCCAGCAGCTGCTCACATGCCATGTACAACATCATG 
AAGTGCTGTTGGCGCTGGAGTGAGGACAGCCGGCCCTTACTTGTTCAGCTGCTCCAGCGC 
CTAGAAGCTGCTTCTAGATCTGCCGATGACAAGGCTGTGTTGCAAGTGCCAGAGTTGGTG 
GTGGGTGAACTGTATGCAGATGTGGCTGGCATCAGGGCAGAAAGCATTTCCTATAGCTTC 
AGTGTCCTTTGAAGATGGTCCTAGACAAATGACTATATATGGGTGGAATTAGTTCCTTGA 
AGAACAGAGAGAAGGAACTTTCTGTGGCCCACCAAGGGAGAAAAAAGGACATGGATCTTG 
CATCTTTCCCTAAACATTTTCCTAGACATCTGAAATGCTGCTGGATGAAGCTCTACCTCT 
ACATACGATGTACTGTTGAGCTAAGAATCACCATCAATTGTAGTTTGGTTTGCAGTGCCA 
AGGGCTGAAGTATAAGTGGTGGACCGTGTCATTCTAAAGGAGGTTTTTAAAATCTGCAAT 
GATTGTAAGGGAATTAGGCAAAAGGGCTGGTCCCACTCACTCCAGGGTGGTTTACTACTG 
AAACTAGTTTTTCTTTTGTTTTTTTTTAAGTTAAACTATTACAGAGTAAAAATAAACCAG 
ATGGGGATGAATGAACACCTTCTAATTTTTAAGCATGAATTGAATATTGGAATTGATGAG 
AAAGAAAATTCTAGGTTCTTTTTGCTAAGAGGTGTTAAGGTGAGTCAATATATCCTTCAA 
GGAAAGGCTTTGTCTCATCTATGTTGAGGGGACGTAAAAGTCCTGGTCCCGTTATGAAGA 
GCTGAAGAAGATCTATAAGAAACAATACTGAGCCTTTCCTTGACTATAGATAGAAGAGCA 
TC C TTT C AT TG AACT C TG AGGC AGG T GG AC C ATG CAT GAT AC TAAGTTTAATT AG AAGC A 
GGAGGAGTCATTTCATGATTAGGAACATTGTTCATCCCATTTGTTTGCCAGTTCCTGTAA 
GACTAAGGAGAATCAGCCTATAGAGCCAAAGCTAGAACCAGGGATAAAAAGTGTGTGTGT 
GTATAACAAATAGGAAGCATGAAAGTCGAGCAAGAAGACTTAGTAACCCAGGTGGTCATT 
AAGAGGTACAGAGAAGAAGTAATCTTATAGGAATGGATGGTAGCTTCTAATTTTTAACCA 
TTCATTGAAATAACTGTGAAGCAACTCATTAAACTAGTATTTATTGACCAAAAGTAGACT 
TTTCAGGTGTATAGCTGCCAAAATCTCTATAATAAAGAGGCTAAAAGAAAATAAATGGGA 
GTTATTTTACTAGGAAAATTAGAGAACCTATAGTTTCCAAAAAGAGATTCTTTATGTGCA 
AAATGAGATAACTCTCTACCTCACAGGGTTGGTGTGAGGAACAATGAGAATATGTATTTG 
TGTATTATGTAGAATATAATATATTCTCAATAAATACTAGTTTTTCCCCTTTC 

SEQ ID NO: 1 1 8_SGK2 AL.PHA__H 

GAAGAGGGCAGAGCCGTGCATGGGGCTGCTCCCCAGGACCTGAGCAGGAACCTGGAGTTT 
TCAGAGCTGCCTGATCATTGCTACAGAATGAACTCTAGCCCAGCTGGGACCCCAAGTCCA 
CAGCCCTCCAGGGCCAATGGGAACATCAACCTGGGGCCTTCAGCCAACCCAAATGCCCAG 
CCCACGGACTTCGACTTCCTCAAAGTCATCGGC7VAAGGGAACTACGGGAAGGTCCTACTG 
GCCAAGCGCAAGTCTGATGGGGCGTTCTATGCAGTGAAGGTACTACAGAAAAAGTCCATC 
TTAAAGAAGAAAGAGCAGAGCCACATCATGGCAGAGCGCAGTGTGCTTCTGAAGAACGTG 
CGGCACCCCTTCCTCGTGGGCCTGCGCTACTCCTTCCAGACACCTGAGAAGCTCTACTTC 
GTGCTCGACTATGTCAACGGGGGAGAGCTCTTCTTCCAGCTGCAGCGGGAGCGCCGGTTC 
CTGGAGCCCCGGGCCAGGTTCTACGCTGCTGAGGTGGCCAGCGCCATTGGCTACCTGCAC 
TCCCTCAACATCATTTACAGGGATCTGAAACCAGAGAACATTCTCTTGGACTGCCAGGGA 
CACGTGGTGCTGACGGATTTTGGCCTCTGCAAGGAAGGTGTAGAGCCTGAAGACACCACA 
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TCCACATTCTGTGGTACCCCTGAGTACTTGGCACCTGAAGTGCTTCGGAAAGAGCCTTAT 
GATCGAGCAGTGGACTGGTGGTGCTTGGGGGCAGTCCTCTACGAGATGCTCCATGGCCTG 
CCGCCCTTCTACAGCCAAGATGTATCCCAGATGTATGAGAACATTCTGCACCAGCCGCTA 
CAGATCCCCGGAGGCCGGACAGTGGCCGCCTGTGACCTCCTGCAAAGCCTTCTCCACAAG 
GACCAGAGGCAGCGGCTGGGCTCCAAAGCAGACTTTCTTGAGATTAAGAACCATGTATTC 
TTCAGCCCCATAAACTGGGATGACCTGTACCACAAGAGGCTAACTCCACCCTTCAACCCA 
AATGTGACAGGACCTGCTGACTTGAAGCATTTTGACCCAGAGTTCACCCAGGAAGCTGTG 
TCCAAGTCCATTGGCTGTACCCCTGACACTGTGGCCAGCAGCTCTGGGGCCTCAAGTGCA 
TTCCTGGGATTTTGTTATGGGGCAGAGGATGATGACATCTTGGATTGGTAGAAGAGAAGG 
ACCTGTGAAACTAGTGAGGCGAGCTGGTATTAGTAAGGAATTAGCTTGAGGTGGTAGGAA 
GAGCGACTCAAAGTAACAATGGGTTCAAGGAGAAGCAGGTTTATTTTTTCCAGCAGATAA 
AAGAAAAATAATGTTTCGGAGTCCAGGACTGGCAGGACAGGTCATCAGATACTCAGAGGC 
TGTATCTCTGCGCTGCCAACCTTGACAAATGGCTTCCAATGTTAGGTTTGCTACT^AGATG 
GTTACTGGAGGTGTAGCTGGGTATTTTGTGTTTAGGGAAGGGAAAATGGAGGAAAGGGGA 
GAAGAGGAAAGGGGGGTTTTAAAGAGCTTTCGCAAAAGGTGCGGCCAATGACTTTTGGTT 
CCATCTCACTAACCACCCACCCCTACCTGGAATGGAGGCTGGGAAATGTGGCTTATTTGC 
TGGGTAGGTGACTATCCCTAATAACAAAGGGGTTTTGACGCTAAGACATTAGGGGAGAAT 
GTTGGGTAGGCAGCCAGCCCTCTTTTACGATAGGGCGTCCTGGTGTTTGGATTTTGATCT 
CAATGTGTAAAATGACAGAGATGTAAGAAGGTCATAGGGTATGAATATCTCTTATTGTTG 
TATGTTGAAAAA 

SEQ ID NO: 12 0_CCRK_H 

ATGGACCAGTAGTGCATCGTGGGCCGGATCGGGGAGGGCGCCCACGGCATCGTCTTCAAG 
GCCAAGCAGGTGGAGAGTGGCGAGATAATTGCGCTCAAGAAGGTGGGCCTAAGGCGGTTG 
GAAGACGGGTTCCCTAAGCAGGCGCTGCGGGAGATTAAGGCTCTGCAGGAGATGGAGGAG 
AATCAGTATGTGGTACAACTGAAGGCTGTGTTCCCACACGGTGGAGGGTTTGTGCTGGCG 
TTTGAGTTCATGGTGTGGGATGTGGCCGAGGTGGTGCGGCATGCCCAGAGGGCACTAGCC 
CAGGCACAGGTCAAGAGCTACCTGCAGATGCTGCTCAAGGGTGTCGCCTTCTGCCATGCC 
AACAACATTGTACATCGGGACGTGAAACGTGCCAAGGTGGTGATCAGGGGGTCAGGCCAG 
CTCAAGATAGCGGACTTTGGCCTGGCTCGAGTCTTTTCCCCAGACGGCAGCCGCCTCTAC 
ACACACCAGGTGGCCACCAGGTGTGTGGGGTGCATCATGGGGGAGCTGTTGAATGGGTCG 
CCCCTTTTCCCGGGCAAGAACGATATTGAACAGCTTTGCTATGTGCTTCGCATCTTGGGC 
ACCCCAAACCCTCAAGTCTGGCCGGAGCTCACTGAGCTGCCGGACfACAACAAGATCTCC 
TTTAAGGAGCAGGTGCGGATGCCCCTGGAGGAGGTGCTGCCTGACGTCTCTCCCCAGGCA 
TTGGATCTGCTGGGTCAATTCCTTCTCTACCCTCCTCACCAGGGCATCGCAGGTTCCAAG 
GGTCTCGTCCATCAGTACTTCTTCACAGCTGCCCTGCCTGCCCATCCATGTGAGGTGGCG 
ATTGGTCAGCGTCTAGGGGGACGTGGCCCCAAGGCCCATCCAGGGCCGCGCCACATCCAT 
GACTTCCACGTGGACCGGCCTCTTGAGGGAGTCGCTGTTGAACCCAGAGCTGATTCGGCC 
CTTCATCCTGGAGGGGTGAGAAGTTGGGCGTGGTCCCGTCTGCCTGCTCCTCAGGACCAC 
TCAGTCCACCTGTTCCTCTGCCACCTGCGTGGCTTCACCCTCCAAGGGGTCCCCATGGCG 
ACAGTGGGGCCACACCACAGGTTGCCCCTTAGCCCTTGGGAGGGTTGGTGTCGAGGCAGA 
GGTCATGTTCCCAGCCAAGAGTATGAGAACATCCAGTCGAGCAGAGGAGATTCATGGCCT 
GTGGTCGGTGAGCCTTACGTTGTGTGTGCTACTGAGGTAGGCATCAGGACAGTGAGGTGT 
GCTGCCAGTCAAGGCCTGCATATGCAGAATGACGATGCCTGCCTTGGTGCTGCTTCCCCC 
GAGTGGTGGGTCGTGGTCAAGGAGAAGTGCAGAGAGTAA 

SEQ ID NO: 12 1_TESK2 H 

GAATTCGCGGCCGCTCGACGCTCAGCAGAGCTACCAGCTGCCCTGTTGGCTTCGCTGGTG 
GGATCGTCCTCCTGGCCCCGCCAAACAGGCGAGCGGCCCCGACTGTGGGGCATGGGAGTA 
GTCTCCTCGTTCTC^CGCCGCCGCTAGCCTAGCTGAGTCGCCGGCTTCTfjCGCTAGGGGCT 
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CCCACCGCCTCCGCAGGCTAAGGAGCCGCTGCCACCAACGAGCTGTGAGGGTTACTATG' 

TCCCTCTTTGCCGCCGTCTCCTCCTCTTGCCCGCGCAGGCACCCCTCTGGCTGCTCAGl'C 

CTGCCTCAGTGTCAAACCAGAAGAGAAGTAAAATTCAACAAAAATTTATGTGTGGAGTTi ' 

CTTCTTAAAAGAAGAAAAAAGTGATTATTTAGACTATGGATCGGAGCAAACGGAATTCAA 

TTGCAGGATTTCCTCCACGTGTGGAGCGTCTTGAAGAGTTTGAAGGAGGTGGTGGAGGAG 

AAGGAAATGTGAGCCAGGTGGGAAGAGTTTGGCCATCTTCGTATCGAGCTCTTATAAGTG 

CCTTTTCCAGACTGACGCGTTTGGATGATTTCACCTGTGAAAAAATAGGGTCTGGCTTCT 

TTTCTGAAGTGTTCAAGGTACGACACCGAGCTTCTGGTCAGGTGATGGCTCTTAAGATGA 

ACACATTGAGCAGTAACCGGGCAAACATGCTGAAAGAAGTACAGCTCATGAATAGACTCT 

CCCATCCCAACATCCTTAGGTATATCAACTCCGGGAACCTGGAACAGTTGCTAGACAGTA 

ACCTGCATTTGCCTTGGACTGTGAGGGTAAAACTGGCCTATGACATAGCAGTGGGCCTCA 

GCTACCTTCACTTCAAAGGCATTTTTCATCGGGACCTCACATCTAAGAACTGCCTGATAA 

AGAGGGATGAGAATGGTTACTCTGCAGTGGTAGCTGACTTTGGCCTGGCTGAGAAGATCC 

CCGATGTCAGCATGGGGAGTGAGAAGCTGGCCGTGGTGGGTTCCCCATTCTGGATGGCAC 

CTGAGGTTCTCCGAGATGAGCCCTATAATGAAAAGGCAGATGTGTTCTCTTATGGTATCA 

TCCTCTGCGAGATCATCGCCCGCATCCAGGCCGATCCGGACTATCTTCCCCGCACAGAGA 

ATTTCGGGCTGGACTATGATGCTTTCCAGCACATGGTGGGAGACTGTCCCCCAGATTTTC 

TGCAACTTACTTTCAACTGCTGTAACATGGATCCCAAACTGCGCCCATCTTTTGTGGAGA 

TTGGGAAGACCCTGGAGGAAATTCTGAGCCGCCTACAGGAAGAAGAGCAGGAGAGGGATA 

GGAAGCTGCAGCCCACAGCCAGGGGACTCTTGGAGAAAGCACCTGGGGTGAAGCGACTAA 

GCTCACTGGATGACAAGATCCCCCACAAGTCACCATGCCCAAGACGTACCATCTGGCTGT 

CTCGAAGCCAGTCAGATATCTTTTCCCGTAAGCCCCCACGTACAGTGAGTGTCTTGGACC 

CATACTACCGGCCACGAGATGGTGCTGCCCGCACCCCCAAAGTCAACCCTTTTAGTGCTC 

GCCAGGACCTCATGGGGGGCAAGATCAAGTTTTTTGACCTGCCCAGCAAGTCTGTCATCT 

CTCTGGTATTTGACCTGGATGCACCAGGGCCCGGAACTATGCCCCTGGCTGACTGGCAGG 

AGCCCCTGGCCCCACCTATTCGCCGGTGGCGTTCCTTGCCTGGTTCGCCTGAGTTCTTGC 

ATCAAGAGGCTTGTCCATTTGTGGGCCGGGAAGAATCGCTATCTGATGGGCCCCCACCAC 

GCCTAAGTAGTCTCAAGTACAGAGTTAAAGAGATCCCACCATTCCGGGCATCTGCCCTAC 

CAGCTGCTCAAGCCCATGAGGCTATGGACTGCTCCATTCTCCAGGAAGAAAATGGTTTTG 

GGTCCAGGCCCCAGGGGACCAGTCCATGCCCTGCGGGTGCTTCTGAGGAGATGGAGGTAG 

AAGAAAGGCCAGCAGGCTCAACTCCAGCCACCTTCTCCACCTCAGGCATAGGCCTGCAAA 

CCCAGGGAAAGCAGGATGGGTGAGGGGGTTTAGTCCCTGCCTCACCTTGGGGATGGACCT 

TCAGCTGAAACCATATGGCCCCCTAGGTGCACAGCCTTGATTCTTCCCTGGAGCCTACAG 

AGCAGGCAGGCTAGGCCAAGCCAGGCTCAACTTCTGGGCTCCCAGTGCCCATTGGCTGTG 

TATGACGGGAGGCAGCAGTGAGAGGCCTTCCTAGTTAGGGCCAACAGCTGATACCAAGCC 

TCTGAAATCCAGCAAGGAGGTCTGCCTCCCACCAGACCCTCTCCAGTGTACTTCCCCAGA 

TAGGACCAGAGGATGTCTAGTTCTAGGCTGAGCTGGCAGGCAGCTATTACCCCGGTTCTT 

TCCCCACCCCAGGTCTGTCTCTTGCCTTTTCTTGGGGCATATAAGCTACTGAGTGGAACA 

TGGAGCTGATCAAGAGGCCGTAATGGTCATGGCTGTTTCCAGACCTGAATATTGGGTGCT 

TCTTGCCAGTATTCTAAGACATTTGAGTAATTGCTGTTTGCACTTACTGCATGGTCAGAC 

CACGTCACTACATTTCTATGCAAGGGGACAGCAAGGCAGCGTGGTGGTCATGGCTCTTAG 

CTAACCTATTCAAAGACCTTTTCCTGTTGATTAATCTATTTTCATATTTATAAAGGAGTC 

TTAATGTTCTGCCCCATAAGACTTTCAACCTTGTGGTTGGGAGTGGGGCTGGTTTTGTAG 

GCCCTAGGGCCTGCTTCTATGTATTTATCAACATGTGATACATTCAATTGGTTAAATGGT 

TTATACAGGGACTGATTTGCTTCCCTTCCTGCCATGGCTGGAGCTTTGGGAACAGTCTGT 

CCTTACAGAGCTGCAATAAGAAATAACCAAAGATGAAGCTGGTCAAATATTTTCATAACT 

TGCTTCTGTTGATTTTTTTTTTGTAAAACTTTCCCAAGACATTTTCAGACTTAAAAATAA 
AGTCAGTGTTACAGGT 



